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Preface 



Adaptive control has been a remarkable field for industrial and academic research since 
1950s. Since more and more adaptive algorithms are applied in various control applications, 
it is considered as important for practical implementation. As it can be confirmed from the 
increasing number of conferences and journals on adaptive control topics, it is certain that 
the adaptive control is a significant guidance for technology development. 

Also adaptive control has been believed as a breakthrough for realization of intelligent 
control systems. Even with the parametric and model uncertainties, adaptive control enables 
the control system to monitor the time varying changes and manipulate the controller for 
desired performance. Therefore adaptive control has been considered to be essential for time 
varying multivariable systems. Moreover, now with the advent of high-speed microproces- 
sors, it is possible to implement the innovative adaptive algorithms even in real time situa- 
tion. 

With the efforts of many control researchers, the adaptive control field is abundant in 
mathematical analysis, programming tools, and implementational algorithms. The authors 
of each chapter in this book are the professionals in their areas. The results in the book 
introduce their recent research results and provide new idea for improved performance in 
various control application problems. 

The book is organized in the following way. There are 16 chapters discussing the issues 
of adaptive control application to model generation, adaptive estimation, output regulation 
and feedback, electrical drives, optical communication, neural estimator, simulation and 
implementation: 

Chapter One: Automatic 3D Model Generation based on a Matching of Adaptive 
Control Points, by N. Lee, J. Lee, G. Kim, and H. Choi 

Chapter Two: Adaptive Estimation and Control for Systems with Parametric and 
Nonparametric Uncertainties, by H. Ma and K. Lum 

Chapter Three: Adaptive Output Regulation of Unknown Linear Systems with 
Unknown Exosystems, by I. Mizumoto and Z. Iwai 

Chapter Four: Output Feedback Direct Adaptive Control for a Two-Link Flexible 
Robot Subject to Parameter Changes, by S. Ozcelik and E. Miranda 

Chapter Five: Discrete Model Matching Adaptive Control for Potentially In- 
versely Non-Stable Continuous-Time Plants by Using Multirate Sampling, by S. 
Alonso-Quesada and M. De la Sen 

Chapter Six: Hybrid Schemes for Adaptive Control Strategies, by R. Ribeiro and K. 
Queiroz 



VI 



Chapter Seven: Adaptive Control for Systems with Randomly Missing Measure- 
ments in a Network Environment, by Y. Shi and H. Fang 

Chapter Eight: Adaptive Control based on Neural Network, by S. Wei, Z. Lujin, Z. 
Jinhai, and M. Siyi 

Chapter Nine: Adaptive Control of the Electrical Drives with the Elastic Coupling 
using Kalman Filter, by K. Szabat and T. Orlowska-Kowalska 

Chapter Ten: Adaptive Control of Dynamic Systems with Sandwiched Hysteresis 
based on Neural Estimator, by Y. Tan, R. Dong, and X. Zhao 

Chapter Eleven: High-Speed Adaptive Control Technique based on Steepest De- 
scent Method for Adaptive Chromatic Dispersion Compensation in Optical Com- 
munications, by K. Tanizawa and A. Hirose 

Chapter Twelve: Adaptive Control of Piezoelectric Actuators with Unknown Hys- 
teresis, by W. Xie, J. Fu, H. Yao, and C. Su 

Chapter Thirteen: On the Adaptive Tracking Control of 3-D Overhead Crane Sys- 
tems 

Chapter Fourteen: Adaptive Inverse Optimal Control of a Magnetic Levitation 
System, by Y. Satoh, H. Nakamura, H. Katayama, and H. Nishitani 

Chapter Fifteen: Adaptive Precision Geolocation Algorithm with Multiple Model 
Uncertainties, by W. Sung and K. You 

Chapter Sixteen: Adaptive Control for a Class of Non-affine Nonlinear Systems 
via Neural Networks, by Z. Tong 

We expect that the readers have taken a basic course in automatic control, linear systems, 
and sampled data systems. This book is tried to be written in a self-contained way for better 
understanding. Since this book introduces the development and recent progress of the 
theory and application of adaptive control research, it is useful as a reference especially for 
industrial engineers, graduate students in advanced study, and the researchers who are re- 
lated in adaptive control field such as electrical, aeronautical, and mechanical engineering. 



Kwanho You 

Sungkyunkwan University, Korea 
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Abstract 

The use of a 3D model helps to diagnosis and accurately locate a disease where it is neither 
available, nor can be exactly measured in a 2D image. Therefore, highly accurate software 
for a 3D model of vessel is required for an accurate diagnosis of patients. We have generated 
standard vessel because the shape of the arterial is different for each individual vessel, 
where the standard vessel can be adjusted to suit individual vessel. In this paper, we 
propose a new approach for an automatic 3D model generation based on a matching of 
adaptive control points. The proposed method is carried out in three steps. First, standard 
and individual vessels are acquired. The standard vessel is acquired by a 3D model 
projection, while the individual vessel of the first segmented vessel bifurcation is obtained. 
Second is matching the corresponding control points between the standard and individual 
vessels, where a set of control and corner points are automatically extracted using the Harris 
corner detector. If control points exist between corner points in an individual vessel, it is 
adaptively interpolated in the corresponding standard vessel which is proportional to the 
distance ratio. And then, the control points of corresponding individual vessel match with 
those control points of standard vessel. Finally, we apply warping on the standard vessel to 
suit the individual vessel using the TPS (Thin Plate Spline) interpolation function. For 
experiments, we used angiograms of various patients from a coronary angiography in 
Sanggye Paik Hospital. 

Keywords: Coronary angiography, adaptive control point, standard vessel, individual 
vessel, vessel warping. 

1. Introduction 

X-ray angiography is the most frequently used imaging modality to diagnose coronary 
artery diseases and to assess their severity. Traditionally, this assessment is performed 
directly from the angiograms, and thus, can suffer from viewpoint orientation dependence 
and lack of precision of quantitative measures due to magnification factor uncertainty 
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(Messenger et al., 2000), (Lee et al, 2006) and (Lee et al, 2007). 3D model is provided to 
display the morphology of vessel malformations such as stenoses, arteriovenous 
malformations and aneurysms (Holger et al, 2005). Consequently, accurate software for a 
3D model of a coronary tree is required for an accurate diagnosis of patients. It could lead to 
a fast diagnosis and make it more accurate in an ambiguous condition. 

In this paper, we present an automatic 3D model generation based on a matching of 
adaptive control points. Fig. 1 shows the overall flow of the proposed method for the 3D 
modelling of the individual vessel. The proposed method is composed as the following 
three steps: image acquisition, matching of the adaptive control points and the vessel 
warping. In Section 2, the acquisitions of the input image in standard and individual vessels 
are described. Section 3 presents the matching of the corresponding control points between 
the standard and individual vessels. Section 4 describes the 3D modelling of the individual 
vessel which is performed through a vessel warping with the corresponding control points. 
Experimental results of the vessel transformation are given in Section 5. Finally, we present 
the conclusion in Section 6. 




Fig. 1. Overall flow of the system configuration 
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2. Image Acquisition 

We have generated a standard vessel because the shape of the arterial is different for each 
individual vessel, where the standard vessel can be adjusted to suit the individual vessel 
(Chalopin et aL, 2001), (Lee et al., 2006) and (Lee et aL, 2007). The proposed approach is 
based on a 3D model of standard vessel which is built from a database that implemented a 
Korean vascular system (Lee et aL, 2006). 

We have limited the scope of the main arteries for the 3D model of the standard vessel as 
depicted in Fig. 2. 




Fig. 2. Vessel scope of the database for the 3D model of the standard vessel 

Table 1 shows the database of the coronary artery of Lt. main (Left Main Coronary Artery), 
LAD (Left Anterior Descending) and LCX (Left Circumflex artery) information. This 
database consists of 40 people with mixed gender information. 





age 


Lt. main 


LAD 


LCX 


Os 


distal 


length 


Os 


distal 


length 


Os 


distal 


length 


below 60 years of 
old (male) 


48.4±5.9 


4.3±0.4 


4.1±0.5 


9.9±4.2 


3.8±0.4 


3.6+0.4 


17.0+52 


3.5±0.4 


3.3±0.3 


192±6.1 


above 60 years of 
old (male) 


67.5±5.4 


4.5±0.5 


4.4+0.4 


8.4+3.8 


3.9±0.3 


3.6+0.3 


172+5.8 


3.6±0.4 


3.4+0.4 


24.6±8.9 


below 60 years of 
old (female) 


44.9±19.9 


3.7±1.8 


3.4+1.6 


10.6±6.2 


3.3±1.5 


3.1+L4 


14.1+5.5 


29+L3 


28±12 


21.3+92 


above 60 years of 
old (female) 


70.7±4.4 


4.3±0.7 


4.1±0.6 


12.5+7.9 


3.5±0.6 


3.4+0.5 


22.3±7.3 


3.3±ol 







am 



Table 1. Database of the coronary artery 
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To quantify the 3D model of the coronary artery, the angles of the vessel bifurcation are 
measured with references to LCX, Lt. main and LAD, as in Table 2. Ten individuals 
regardless of their gender and age were selected randomly for measuring the angles of the 
vessel bifurcation from six angiograms. The measurement results, and the average and 
standard deviations of each individual measurement are shown in Table 2. 





RAO30 
CAUD30 


RAO30 
CRA30 


AP0° 
CRA30 


LAO60 
CRA30 


LAO60 
CAUD30 


AP0° 
CAUD30 


1 


69.17 


123.31 


38.64 


61.32 


84.01 


50.98 


2 


53.58 


72.02 


23.80 


51.75 


99.73 


73.92 


3 


77.28 


97.70 


21.20 


57.72 


100.71 


71.33 


4 


94.12 


24.67 


22.38 


81.99 


75.6 


69.57 


5 


64.12 


33.25 


31.24 


40.97 


135.00 


61.87 


6 


55.34 


51.27 


41.8 


80.89 


119.84 


57.14 


7 


71.93 


79.32 


50.92 


87.72 


114.71 


58.22 


8 


67.70 


59.14 


31.84 


58.93 


92.36 


70.16 


9 


85.98 


60.85 


35.77 


54.45 


118.80 


78.93 


10 


47.39 


60.26 


34.50 


47.39 


67.52 


34.79 


Average 


68.67 


66.18 


33.21 


62.31 


100.83 


62.69 


Standard 
deviation 


14.56 


29.07 


9.32 


15.86 


21.46 


13.06 



Table 2. Measured angles of the vessel bifurcation from six angiographies 



Fig. 3 illustrates the results of the 3D model generation of the standard vessel from six 
angiographies: RAO (Right Anterior Oblique)30° CAUD (Caudal)30°, RAO30 CRA (Cranial 
Anterior)30°, AP (Anterior Posterior)0° CRA (Cranial Anterior)30°, LAO (Left Anterior 
Oblique)60° CRA30 , LAO60 CAUD30 , AP0° CAUD30 . 
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Fig. 3. 3D model generation of the standard vessel from six angiographies 



Evaluating the angles of the vessel bifurcation from six angiographies can reduce the 
possible measurement error which occurs when the angle from a single view is measured. 
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It is difficult to transform the standard vessel into individual vessel in a 3D space (Lee et al., 
2006) and (Lee et al, 2007). Therefore, we projected the 3D model of the standard vessel into 
2D projection. Fig. 4 shows the projected images of the standard vessel on a 2D plane 
through the projection. The projection result can be view as vertices or polygons based. 



ft B j4jJ'f#r , 




Fig. 4. Projection result for 2D image of standard vessel 



3. Matching of the Adaptive Control Points 

To transform a standard vessel into an individual vessel, it is important to match 
corresponding control points (Lee et al., 2006) and (Lee et al., 2007). In this paper, we 
extracted feature points of the vessel automatically and defined as control points (Lee et al., 
2006) and (Lee et al., 2007). Feature points mean is referred to the corner points of an object 
or points with higher variance brightness compared to the surrounding pixels in an image, 
which are differentiated from other points in an image. Such feature points can be defined in 
many different ways in (Parker, 1996) and (Pitas, 2000). They are sometimes defined as 
points that have a high gradient in different directions, or as points that have properties that 
do not change in spite of specific transformations. Generally feature points can be divided 
into three categories (Cizek et al., 2004). The first one uses a non-linear filter, such as the 
SUSAN corner detector proposed by Smith (Woods et al., 1993) which relates each pixel to 
an area centered by a pixel. In this area, it is called the SUSAN area; all the pixels have 
similar intensities as the center pixel. If the center pixel is a feature point (some times a 
feature point is also referred to as a "corner"), SUSAN area is the smallest one among the 
pixels around it. A SUSAN corner detector can suppress a noise effectively without 
derivating an image. The second one is based on a curvature, such as the Kitchen and 
Rosenf eld's method (Maes et al., 1997). This kind of method needs to extract edges in 
advance, and then elucidate the feature points using the information on the curvature of the 
edges. The disadvantage of this method is required more needs a complicated computation, 
e.g. curve on fitting, thus its processing speed is relatively slow. The third method is exploits 
a change of the pixel intensity. A typical one is the Harris and Stephens' method (Pluim et 
al., 2003). It produces a corner response through an eigenvalues analysis. Since it does not 
need to use a slide window explicitly, its processing speed is very fast. Accordingly, this 
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paper used the Harris corner detector to find the control points of standard and individual 
vessels (Lee et al, 2006) and (Lee, 2007). 

3.1 Extraction of the Control Points 

The Harris corner detector is a popular interest point detector due to its strong invariance 
such as rotation, scale, illumination variation and image noise (Schmid et al., 2000) and 
(Derpanis, 2004). It is based on the local auto-correlation function of a signal. The local auto- 
correlation function measures the local changes of the signal with patches shifted by a small 
amount in different directions (Derpanis, 2004). However, the Harris corner detector has a 
problem where it can mistake those non-corner points. 

Fig. 5 shows extracted 9 control points in individual vessel by using the Harris corner 
detector. We noticed that some of the extracted control points are non-corner points. To 
solve this problem of the Harris corner detector, we extracted more control points of 
individual vessel than standard vessel. Fig. 6 shows the extraction of control points from 
individual and standard vessels. 




Fig. 5. Extracted 9 control points in individual vessel 

3.2 Extraction of Corner Points 

We performed thinning by using the structural characteristics of vessel to find the corner 
points among the control points of individual vessel which is extracted with the Harris 
corner detector (Lee, 2007). Fig. 7 shows the thinning process for detection of corner points 
in individual vessel. 
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(a) Segmented vessel (b) Thinned vessel 

Fig. 6. Thinning process for detection of corner points in individual vessel 



A vascular tree can be divided into a set of elementary components, or primitives, which are 
the vascular segments, and bifurcation (Wahle et al., 1994). Using this intuitive 
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representation, it is natural to describe the coronary tree by a graph structure (Chalopin et 

al., 2001) and (Lee, 2007). 

A vascular tree of thinned vessel consists of three vertices ( v int ) and one bifurcation (bif ) 

as the following equation (1). Here, vertices ( v int ) are comprised a start point ( v start int ) 

and two end points (v end _ pointl , v endpomX2 ). 



I thin - i v point* bif } 



{v ; 



po int I y start _po mt ' 



' ^end pointl' ^end point! i 



(i) 



If the reference point is a vertex, the closest two control points to the vertex are defined as 
the corner points. If the reference point is a bifurcation, the three control points that are 
closest to it after comparing the distances between the bifurcation and all control points are 
defined as the corner points. As shown in Fig. 7, if the reference point is the vertex 
( v start point )' v i an< ^ v 2 become the corner points; if the reference point is the bifurcation 
{bif ), v 6 ,v n and v 15 become the corner points (Lee, 2007). 



I 




Fig. 7. Primitives of a vascular net 



3.3 Adaptive Interpolation of the Control Points between Corner Points 

Once the control points and corner points are extracted from an individual vessel, an 
interpolation for a standard vessel is applied. For an accurate matching, the control points 
are adaptively interpolated into the corresponding standard vessel in proportion to the 
distance ratio if there are control points between the corner points in an individual vessel 
(Lee, 2007). 

Fig. 8 shows the process of an interpolation of the control points. Control points of a 
standard vessel are adaptively interpolated by the distance rate between control point ( v 3 ) 
and two corner points ( v 2 , v 4 ) of an individual vessel. Fig. 8 (a) shows the extracted control 
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points from an individual vessel, and (b) shows an example of control point interpolated 
between a standard vessel and the corresponding corner points from (a) image. 




M 

(a) Individual vessel 



(b) Standard vessel 



Fig. 8. Interpolation of the control points for a standard vessel 

Fig. 9 shows the result of extracting the control points by using the Harris corner detector to 
the segmented vessel in the individual vessel and an adaptive interpolation of the 
corresponding the control points in the standard vessel. 
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Fig. 9. Result of an adaptive interpolation of the corresponding control points 



4. Vessel Warping 

We have warped the standard vessel with respect to the individual vessel. Given the two 
sets of corresponding control points, S = {s l s 2 , . . . s m } and / = {i x i 2 ,...i m }, the warping is applied 

the standard vessel to suit the individual vessel. Here, S is a set of control points in the 
standard vessel and / is a set of one in the individual vessel (Lee et al., 2006) and (Lee et al., 
2007). 
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Standard vessel warping was performed using the TPS (Thin-Plate-Spline) algorithm 
(Bentoutou et aL, 2002) from the two sets of control points. 

The TPS is the interpolation functions that exactly represent a distortion at each feature 
point, and for defining a minimum curvature surface between control points. A TPS 
function is a flexible transformation that allows for a rotation, translation, scaling, and 
skewing. It also allows for lines to bend according by the TPS model (Bentoutou et aL, 2002). 
Therefore, a large number of deformations can be characterized by the TPS model. 
The TPS interpolation function can be written as equation (2). 



h(x) = Ax + t + ^W i K(\\x-x i \\) (2) 



1=1 

The variables A and t are the affine transformation parameters matrices, W t are the weights 

of the non-linear radial interpolation function K , and x t are the control points. The function 

K(r) is the solution of the biharmonic equation (A K = 0) that satisfies the condition of a 

bending energy minimization, namely K(r) = r 2 log (r 2 ) . 

The complete set of parameters, the interpolating registration transformation is defined, and 

then it is used to transform the standard vessel. It should be noted that in order to be able to 

carry out the warping of the standard vessel with respect to the individual vessel, it is 

required to have a complete description of the TPS interpolation function (Lee et aL, 2006) 

and (Lee et aL, 2007). 

Fig. 10 shows the results of modifying the standard vessel to suit the individual vessel. 






(a) Individual vessel (b) Standard vessel (c) Warped vessel 

Fig. 10. Results of the warped vessel in standard vessel 

5. Results of the Vessel Transformation 

We simulated the system environment that is Microsoft Windows XP on a Pentium 3GHz, 
Intel Corp. and the compiler VC++ 6.0 is used. The image of 512x 512 is used for the 
experimentation. Each image has a gray-value resolution of 8 bits, i.e., 256 gray levels. 
Fig. 11 shows the 3D model of the standard vessel from six different angiographic views. 
The results of the standard vessel warping using TPS algorithm to suit the individual vessel 
is shown in Fig. 13. 
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Fig. 11. 3D model of the standard vessel in angiographic of six different views 
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Fig. 12. Result of standard vessel warping 
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Fig. 13 shows the result for an automatically 3D model generation of individual vessel. 
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Fig. 13. Result of 3D model generation for the individual vessel in six views 

6. Conclusion 

We proposed a fully automatic and effective algorithm to perform a 3D modelling of 
individual vessel from angiograms in six views. This approach can be used to recover the 
geometry of the main arteries. The 3D model of the vessel enables patients to visualize their 
progress and improvement for a disease. Such a model should not only enhance the level of 
reliability but also provide a fast and accurate identification. In order words, this method 
can be expected to reduce the number of misdiagnosed cases (Lee et al., 2006) and (Lee et al., 
2007). 
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Abstract 

Adaptive control has been developed for decades, and now it has become a rigorous and 
mature discipline which mainly focuses on dealing parametric uncertainties in control 
systems, especially linear parametric systems. Nonparametric uncertainties were seldom 
studied or addressed in the literature of adaptive control until new areas on exploring 
limitations and capability of feedback control emerged in recent years. Comparing with the 
approach of robust control to deal with parametric or nonparametric uncertainties, the 
approach of adaptive control can deal with relatively larger uncertainties and gain more 
flexibility to fit the unknown plant because adaptive control usually involves adaptive 
estimation algorithms which play role of "learning" in some sense. 

This chapter will introduce a new challenging topic on dealing with both parametric and 
nonparametric internal uncertainties in the same system. The existence of both two kinds of 
uncertainties makes it very difficult or even impossible to apply the traditional recursive 
identification algorithms which are designed for parametric systems. We will discuss by 
examples why conventional adaptive estimation and hence conventional adaptive control 
cannot be applied directly to deal with combination of parametric and nonparametric 
uncertainties. And we will also introduce basic ideas to handle the difficulties involved in 
the adaptive estimation problem for the system with combination of parametric and 
nonparametric uncertainties. Especially, we will propose and discuss a novel class of 
adaptive estimators, i.e. information-concentration (IC) estimators. This area is still in its infant 
stage, and more efforts are expected in the future for gainning comprehensive 
understanding to resolve challenging difficulties. 

Furthermore, we will give two concrete examples of semi-parametric adaptive control to 
demonstrate the ideas and the principles to deal with both parametric and nonparametric 
uncertainties in the plant. (1) In the first example, a simple first-order discrete-time nonlinear 
system with both kinds of internal uncertainties is investigated, where the uncertainty of 
non-parametric part is characterized by a Lipschitz constant L, and the nonlinearity of 
parametric part is characterized by an exponent index b. In this example, based on the idea 
of the IC estimator, we construct a unified adaptive controller in both cases of b = 1 and 
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b > 1, and its closed-loop stability is established under some conditions. When the 
parametric part is bilinear (b = 1), the conditions given reveal the magic number 

hV2 which appeared in previous study on capability and limitations of the feedback 

mechanism. (2) In the second example with both parametric uncertainties and non- 
parametric uncertainties, the controller gain is also supposed to be unknown besides the 
unknown parameter in the parametric part, and we only consider the noise-free case. For this 
model, according to some a priori knowledge on the non-parametric part and the unknown 
controller gain, we design another type of adaptive controller based on a gradient-like 
adaptation law with time-varying deadzone so as to deal with both kinds of uncertainties. 
And in this example we can establish the asymptotic convergence of tracking error under 
some mild conditions, althouth these conditions required are not as perfect as in the first 

example in sense that L < 0.5 is far away from the best possible bound — h V 2 . 

These two examples illustrate different methods of designing adaptive estimation and 
control algorithms. However, their essential ideas and principles are all based on the a 
priori knowledge on the system model, especially on the parametric part and the non- 
parametric part. From these examples, we can see that the closed-loop stability analysis is 
rather nontrivial. These examples demonstrate new adaptive control ideas to deal with two 
kinds of internal uncertainties simultaneously and illustrates our elementary theoretical 
attempts in establishing closed-loop stability. 

1. Introduction 

This chapter will focus on a special topic on adaptive estimation and control for systems with 
parametric and nonparametric uncertainties. Our discussion on this topic starts with a very 
brief introduction to adaptive control. 

1.1 Adaptive Control 

As stated in [SB89], " Research in adaptive control has a long and vigorous history" since 
the initial study in 1950s on adaptive control which was motivated by the problem of 
designing autopilots for air-craft operating at a wide range of speeds and altitudes. With 
decades of efforts, adaptive control has become a rigorous and mature discipline which 
mainly focuses on dealing parametric uncertainties in control systems, especially linear 
parametric systems. 

From the initial stage of adaptive control, this area has been aiming at study how to deal 
with large uncertainties in control systems. This goal of adaptive control essentially means 
that one adaptive control law cannot be a fixed controller with fixed structure and fixed 
parameters because any fixed controller usually can only deal with small uncertainties in 
control systems. The fact that most fixed controllers with certain structure (e.g. linear 
feedback control) designed for an exact system model (called nominal model) can also work 
for a small range of changes in the system parameter is often referred to as robustness, 
which is the kernel concept of another area, robust control. While robust control focuses on 
studying the stability margin of fixed controllers (mainly linear feedback controller), whose 



Adaptive Estimation and Control for Systems with Parametric and Nonparametric Uncertainties 17 



design essentially relies on priori knowledge on exact nominal system model and bounds 
of uncertain parameters, adaptive control generally does not need a priori information 
about the bounds on the uncertain or (slow) time- varying parameters. Briefly speaking, 
comparing with the approach of robust control to deal with parametric or nonparametric 
uncertainties, the approach of adaptive control can deal with relatively larger uncertainties 
and gain more flexibility to fit the unknown plant because adaptive control usually 
involves adaptive estimation algorithms which play role of "learning" in some sense. 
The advantages of adaptive control come from the fact that adaptive controllers can adapt 
themselves to modify the control law based on estimation of unknown parameters by 
recursive identification algorithms. Hence the area of adaptive control has close connections 
with system identification, which is an area aiming at providing and investigating 
mathematical tools and algorithms that build dynamical models from measured data. 
Typically, in system identification, a certain model structure is chosen by the user which 
contains unknown parameters and then some recursive algorithms are put forward based 
on the structural features of the model and statistical properties of the data or noise. The 
methods or algorithms developed in system identification are borrowed in adaptive control 
in order to estimate the unknown parameters in the closed loop. For convenience, the 
parameter estimation methods or algorithms adopted in adaptive control are often 
referred to as adaptive estimation methods. Adaptive estimation and system identification 
share many similar characteristics, for example, both of them originate and benefit from 
the development of statistics. One typical example is the frequently used least-squares (LS) 
algorithm, which gives parameter estimation by minimizing the sum of squared errors (or 
residuals), and we know that LS algorithm plays important role in many areas including 
statistics, system identification and adaptive control. We shall also remark that, in spite of 
the significant similarities and the same origin, adaptive estimation is different from 
system identification in sense that adaptive estimation serves for adaptive control and 
deals with dynamic data generated in the closed loop of adaptive controller, which means 
that statistical properties generally cannot be guaranteed or verified in the analysis of 
adaptive estimation. This unique feature of adaptive estimation and control brings many 
difficulties in mathematical analysis, and we will show such difficulties in later examples 
given in this paper. 

1.2 Linear Regression Model and Least Square Algorithm 

Major parts in existing study on regression analysis (a branch of statistics) [DS98, Ber04, 
Wik08j], time series analysis [BJR08, Tsa05], system identification [Lju98, VV07] and 
adaptive control [GS84, AW89, SB89, CG91, FL99] center on the following linear regression 
model 

z k =6 T <j) k +v k (l) 

where {z k } , (/) k , V k represent observation data, regression vector and noise disturbance (or 

external uncertainties), respectively. Here 6 is the unknown parameter to be estimated. 
Linear regression models have many applications in many disciplines of science and 
engineering [Wik08g, web08, DS98, Hel63, Wei05, MPV07, Fox97, BDB95]. For example, as 
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stated in [web08], Linear regression is probably the most widely used, and useful, statistical 
technique for solving environmental problems. Linear regression models are extremely powerful, and 
have the power to empirically tease out very complicated relationships between variables. Due to the 
importance of model (1.1), we list several simple examples for illustration: 

• Assume that a series of (stationary) data (x k , y k ) (k = 1, 2, ■ ■ ■ , N) are generated from the 
following model 

where /?0 , fil are unknown parameters, {x k } are i. i. d. taken from a certain probability 

distribution, and £ k « N(0, (J ) is random noise independent of X . For this model, let 6 

= IPO, fil ] T , <Pk = [l,xk] T , then we have y k = T <j) k + € k . This example is a classic 

topic in statistics to study the statistical properties of parameter estimates §n as the data size 

N grows to infinity. The statistical properties of interests may include E(# — 0\ Var(#) , 
and so on. 

• Unlike the above example, in this example we assume that X k and X k+l have close 
relationship modeled by 

X k + l=0O+0l X k +e k 

where /?o, /?i are unknown parameters, and £ k « A^(0, <J ) are i. i. d. random noise 

independent of {x\, x% ■ ■ ■ , Xk}. 

This model is an example of linear time series analysis, which aims to study asymptotic 

statistical properties of parameter estimates ^tv under certain assumptions on statistical 

properties of £ k . Note that for this example, it is possible to deduce an explicit expression 

ofxnntermsof £. (y = 0,l,---,£-l). 

• In this example, we consider a simple control system 

x k + i=Po+& x k+ bu k+ £ k 

where b ¥" is the controller gain, £ k is the noise disturbance at time step k. For this model, 

in case where b is known a priori, we can take; 6 — [/? >,A] / A ~[^?^-i] ' 

z k - x k ~ bu k _ x otherwise, we can take 6 = [j3 Q , fi x , b] T , <p k = [1, x k _ x ] T , z k =x k - bu k _ { . 
In both cases, the system can be rewritten as 



z k=0 T A+£ k 
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which implies that intuitively, can be estimated by using the identification algorithm since 
both data z\ and (f> k are available at time step k. Let 6 k denote the parameter estimates at 

time step 6 k , then we can design the control signal u k by regarding as the real parameter 



b 



00 — Pix k ] 



where { f k } is the known reference signal to be tracked, and b , /? , f3 x are estimates of b , 

j3 , /^ , respectively. Note that for this example, the closed-loop system will be very 

complex because the data generated in the closed loop essentially depend on all history 
signals. In the closed-loop system of an adaptive controller, generally it is difficult to 
analyze or verify statistical properties of signals, and this fact makes that adaptive 
estimation and control cannot directly employ techniques or results from system 
identification. Now we briefly introduce the frequently-used LS algorithm for model (1.1) 
due to its importance and wide applications [LH74, Gio85, Wik08e, Wik08f, Wik08d]. The 
idea of LS algorithm is simply to minimize the sum of squared errors, that is to say, 



ft 



LS : 



argmin Y, fa 

C fe=i 



C<t> k ] T [zk-C<t>k] 



(1.2) 



This idea has a long history rooted from great mathematician Carl Friedrich Gauss in 1795 
and published first by Legendre in 1805. In 1809, Gauss published this method in volume 
two of his classical work on celestial mechanics, heoria Motus Corporum Coelestium in 
sectionibus conicis solem ambientium[GdiuQ9\, and later in 1829, Gauss was able to state that the 
LS estimator is optimal in the sense that in a linear model where the errors have a mean of 
zero, are uncorrelated, and have equal variances, the best linear unbiased estimators of the 
coefficients is the least-squares estimators. This result is known as the Gauss-Markov 
theorem [Wik08a]. 

By Eq. (1.2), at every time step, we need to minimize the sum of squared errors, which 
requires much computation cost. To improve the computational efficiency, in practice we 
often use the recursive form of LS algorithm, often referred to as recursive LS algorithm, 
which will be derived in the following. First, introducing the following notations 



(1.3) 



"*r" 


, *n = 


"«" 


, v n = 


■ v\- 


2 T 




l<t>n\ 




[< \ 



and using Eq. (1.1), we obtain that 



z n = ® n o + v n . 
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Noting that 

n 

Y\ z k-CM T [zk-CM 
fc=i 
= [z n - *„c] T [z„ - *„c] 

= [c - (*;$ n )+$;z n ] T $^ n [c - (^$„) + $;z„] 
+3I[i-* n (*;* n ) + #;;]z n 

where the last equation is derived from properties of Moore-Penrose pseudoinverse 
[Wik08h] 



we know that the minimum of [Z n — ©„$"] [Z n — O w £"] can be achieved at 

^ s = (*-^„)+^^„ (1 . 4) 

which is the LS estimate of 6. Let 

and then, by Eq. (1.3), with the help of matrix inverse identity 

[A- 1 + A- l B T C~ l BA- 1 ]- 1 =A- B T {C + BA~ l B T )- l B 
we can obtain that 

= [A- l +A 1 B T C- 1 BA- l Y 1 

= p_, - (P n .A )[i + («-! )#, (^„ )]■' («-> ) 

= P ,-flP ,<M r P , 

where 

a n = (i + </.;p n _ 1 n )- 1 

Further, 
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On = Pn^2 ^ kZ l 



fc=l 



n-1 



k=l 

= &n-l — &nPn-l4>n<i> n Pn-lQn-l + Pn-l^n^n ~ ^nPn-l<f>n^nPn 
= /9 n _i - a n P n -X(j) n (l) T n P n -\d n -i + P n -i</) n [l - a n </>nPn-l<l>n\Zn 
= #n-l — 0, n P n -i(j) n ^ r n P n -iO n -i + a n P n -i(f) n Zn 

= 9 n -t + a n P n -i(j) n (z T n - <f) T n 9n-i) 
Thus, we can obtain the following recursive LS algorithm 

n = Qn-\ + anPn-l4>n(Zn ~ <f> n @n-l) 

where P n -i and d n -i reflect only information up to step n-1, while a n , and Z^ — ^ l n _ l 

reflect information up to step n. 

In statistics, besides linear parametric regression, there also exist generalized linear models 

[Wik08b] and non-parametric regression methods [Wik08i], such as kernel regression 

[Wik08c]. Interested readers can refer to the wiki pages mentioned above and the references 

therein. 



1.3 Uncertainties and Feedback Mechanism 

By the discussions above, we shall emphasize that, in a certain sense, linear regression 
models are kernel of classical (discrete-time) adaptive control theory, which focuses to cope 
with the parametric uncertainties in linear plants. In recent years, parametric uncertainties 
in nonlinear plants have also gained much attention in the literature [MT95, Bos95, Guo97, 
ASL98, GHZ99, LQF03]. Reviewing the development of adaptive control, we find that 
parametric uncertainties were of primary interests in the study of adaptive control, no 
matter whether the considered plants are linear or nonlinear. Nonparametric uncertainties 
were seldom studied or addressed in the literature of adaptive control until some new areas 
on understanding limitations and capability of feedback control emerged in recent years. 
Here we mainly introduce the work initiated by Guo, who also motivated the authors' 
exploration in the direction which will be discussed in later parts. 

Guo r s work started from trying to understand fundamental relationship between the 
uncertainties and the feedback control. Unlike traditional adaptive theory, which focuses on 
investigating closed-loop stability of certain types of adaptive controllers, Guo began to 
think over a general set of adaptive controllers, called feedback mechanism, i.e., all possible 
feedback control laws. Here the feedback control laws need not be restricted in a certain 
class of controllers, and any series of mappings from the space of history data to the space of 
control signals is regarded as a feedback control law. With this concept in mind, since the 
most fundamental concept in automatic control, feedback, aims to reduce the effects of the 
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plant uncertainty on the desired control performance, by introducing the set F of internal 
uncertainties in the plant and the whole feedback mechanism LI, we wonder the following 
basic problems: 

1. Given an uncertainty set F, does there exist any feedback control law in LI which can 
stabilize the plant? This question leads to the problem of how to characterize the maximum 
capability of feedback mechanism. 

2. If the uncertainty set F is too large, is it possible that any feedback control law in LI cannot 
stabilize the plant? This question leads to the problem of how to characterize the limitations 
of feedback mechanism. 

The philosophical thoughts to these problems result in fruitful study [Guo97, XGOO, ZG02, 
XG01, LX06, Ma08a, Ma08b]. 

The first step towards this direction was made in [Guo97], where Guo attempted to answer 
the following question for a nontrivial example of discrete-time nonlinear polynomial plant 
model with parametric uncertainty: What is the largest nonlinearity that can be dealt with 
by feedback? More specifically, in [Guo97], for the following nonlinear uncertain system 



y t+1 = 9<f> t + u t -\- w t +u <h = 0(y\), b > 



(1.5) 



where 9 is the unknown parameter, b characterizes the nonlinear growth rate of the 
system, and { w t } is the Gaussian noise sequence, a critical stability result is found — system 

(1.5) is not a.s. globally stabilizable if and only if b > 4. This result indicates that there exist 
limitations of the feedback mechanism in controlling the discrete-time nonlinear adaptive 
systems, which is not seen in the corresponding continuous-time nonlinear systems (see 
[Guo97, Kan94]). The "impossibility" result has been extended to some classes of uncertain 
nonlinear systems with unknown vector parameters in [XG99, Ma08a] and a similar result 
for system (1.5) with bounded noise is obtained in [LX06]. 

Stimulated by the pioneering work in [Guo97], a series of efforts ([XGOO, ZG02, XG01, 
MG05]) have been made to explore the maximum capability and limitations of feedback 
mechanism. Among these work, a breakthrough for non-parametric uncertain systems was 
made by Xie and Guo in [XGOO], where a class of first-order discrete-time dynamical control 
systems 

2fc+i = f(Vt) +u t -\- w t +u /(') e F(L) (1.6) 

is studied and another interesting critical stability phenomenon is proved by using new 
techniques which are totally different from those in [Guo97]. More specifically, in [XGOO], 
F(L) is a class of nonlinear functions satisfying Lipschitz condition, hence the Lipschitz 
constant L can characterize the size of the uncertainty set F(L). Xie and Guo obtained the 

following results: if £ > _ + Aj. > then there exists a feedback control law such that for any 

2 
/ <5 F(L), the corresponding closed-loop control system is globally stable; and if 

L < — h V 2 , then for any feedback control law and any y G R , there always exists 
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some f E F(L) such that the corresponding closed-loop system is unstable. So for system 

(1.6), the "magic" number _ + J~2 characterizes the capability and limits of the whole 

2 
feedback mechanism. The impossibility part of the above results has been generalized to 
similar high-order discrete-time nonlinear systems with single Lipschitz constant [ZG02] 
and multiple Lipschitz constants [Ma08a]. From the work mentioned above, we can see two 
different threads: one is focused on parametric nonlinear systems and the other one is 
focused on non-parametric nonlinear systems. By examining the techniques in these threads, 
we find that different difficulties exist in the two threads, different controllers are designed 
to deal with the uncertainties and completely different methods are used to explore the 
capability and limitations of the feedback mechanism. 

1.4 Motivation of Our Work 

From the above introduction, we know that only parametric uncertainties were considered 
in traditional adaptive control and non-parametric uncertainties were only addressed in 
recent study on the whole feedback mechanism. This motivates us to explore the following 
problems: When both parametric and non-parametric uncertainties are present in the 
system, what is the maximum capability of feedback mechanism in dealing with these 
uncertainties? And how to design feedback control laws to deal with both kinds of internal 
uncertainties? Obviously, in most practical systems, there exist parametric uncertainties 
(unknown model parameters) as well as non-parametric uncertainties (e.g. unmodeled 
dynamics). Hence, it is valuable to explore answers to these fundamental yet novel 
problems. Noting that parametric uncertainties and non-parametric uncertainties essentially 
have different nature and require completely different techniques to deal with, generally it 
is difficult to deal with them in the same loop. Therefore, adaptive estimation and control in 
systems with parametric and non-parametric uncertainties is a new challenging direction. In 
this chapter, as a preliminary study, we shall discuss some basic ideas and principles of 
adaptive estimation in systems with both parametric and non-parametric uncertainties; as to 
the most difficult adaptive control problem in systems with both parametric and non- 
parametric uncertainties, we shall discuss two concrete examples involving both kinds of 
uncertainties, which will illustrate some proposed ideas of adaptive estimation and special 
techniques to overcome the difficulties in the analysis closed-loop system. Because of 
significant difficulties in this new direction, it is not possible to give systematic and 
comprehensive discussions here for this topic, however, our study may shed light on the 
aforementioned problems, which deserve further investigation. 

The remainder of this chapter is organized as follows. In Section 2, a simple semi-parametric 
model with parametric part and non-parametric part will be introduced first and then we 
will discuss some basic ideas and principles of adaptive estimation for this model. Later in 
Section 3 and Section 4, we will apply the proposed ideas of adaptive estimation and 
investigate two concrete examples of discrete-time adaptive control: in the first example, a 
discrete-time first-order nonlinear semi-parametric model with bounded external noise 
disturbance is discussed with an adaptive controller based on information-contraction 
estimator, and we give rigorous proof of closed-loop stability in case where the uncertain 
parametric part is of linear growth rate, and our results reveal again the magic number 
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— I- ^2 ; in the second example, another noise-free semi-parametric model with 
2 

parametric uncertainties and non-parametric uncertainties is discussed, where a new 
adaptive controller based on a novel type of update law with deadzone will be adopted to 
stabilize the system, which provides yet another view point for the adaptive estimation and 
control problem for the semi-parametric model. Finally, we give some concluding remarks 
in Section 5. 



2. Semi-parametric Adaptive Estimation: Principles and Examples 

2.1 One Semi-parametric System Model 

Consider the following semi-parametric model 

z k=0 T 0k+f(A) + £k (2- 1 ) 

where 6 €=& denotes unknown parameter vector, /( *)€E F denotes unknown function and 
S k E A^ denote external noise disturbance. Here ©, F and Ak represent a priori knowledge 

on possible 6 , f(jj> k ) and G k , respectively. In this model, let 

then Eq. (2.1) becomes Eq. (1.1). Because each term of right hand side of Eq. (2.1) involves 
uncertainty, it is difficult to estimate 6 , f{ffi k ) and S k simultaneously. 
Adaptive estimation problem can be formulated as follows: Given a priori knowledge on 6, 
f( ) and 8 k , how to estimate 6 and/( ) according to a series of data {(j) k ,z k \k — 1,2, • • • , n } 
Or in other words, given a priori knowledge on 6 and vu, how to estimate 6 and Vk according 
to a series of data {^ , z^ ; A: = 1,2,- •• ,n }. 

Now we list some examples of a priori knowledge to show various forms of adaptive 
estimation problem. 

Example 2.1 As to the unknown parameter 0, here are some commonly-seen examples of a priori 
knowledge: 

• There is no any a priori knowledge on except for its dimension. This means that 6 can be 
arbitrary and we do not know its upper bound or lower bound. 

• The upper and lower bounds of 6 are known, i.e. Q<Q<0 , where and 6 are constant vector 
and the relationship "<" means element-wise "less or equal" . 

• The distance between 6 and a nominal do is bounded by a known constant, i.e. \ \ 6 - do \ \ < re, 
where re > is a known constant and do is the center of set 0. 

• The unknown parameter lies in a known countable or finite set of values, that is to say, 6 €={ 6i, 62, 

03, •'*). 

Example 2.2 As to the unknown function f( ), here are some possible examples of a priori knowledge: 

• f(x) = Ofor all x. This case means that there is no unmodeled dynamics. 
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• Function f is bounded by other known functions, that is to say, f{x) < f{x) < f{x) for any x. 

• The distance between f and a nominal fo is bounded by a known constant, i.e. \ \f ~ fo\ I ^ r/ , 
where r/ > is a known constant andfo can be regarded as the center of a ball F in a metric functional 
space with norm \\ ■ | | . 

• The unknown function lies in a known countable or finite set of functions, that is to say, /€E {fi,f% 

• Function f is Lipschitz, i.e. f(x { ) - f(x 2 ) < L | x 1 - x 2 \ for some constant L > 0. 

• Function f is monotone (increasing or decreasing) with respect to its arguments. 

• Function f is convex (or concave). 

• Function f is even (or odd). 

Example 2.3 As to the unknown noise term £ k , here are some possible examples of a priori 
knowledge: 

• Sequence £ k = 0. This case means that no noise/ disturbance exists. 

• Sequence £ k is bounded in a known range, that is to say, £ < £ k < £ for any k. One special case 
is £ = -£ . 

i , ! 

• Sequence £ k is bounded by a diminishing sequence, e.g, \£ k |< — for any K . This case means 

k 
that the noise disturbance converges to zero with a certain rate. Other typical rate sequences include 

{— }, {S k } (0<S<l),andsoon. 

k 

• Sequence £ k is bounded by other known sequences, that is to say, —k — e & — ^ for any k. 

This case generalizes the above 
cases. 

• Sequence £ k is in a known finite set of values, that is to say, £ k G {e 1 , e 2 , * * * , € N } . This case 
may happen in digital systems where all signals can only take values in a finite set. 

• Sequence £ k is oscillatory with specific patterns, e.g. £ k > if k is even and £ k < ifk is odd. 

• Sequence £ k has some statistical properties, for example, Ee k = , Ee k = <J ;; for another 
example, sequence { £ k } is i.i.d. taken from a probability distribution e.g. £ k « [7(0,1) . 

Parameter estimation problems (without non-parametric part) involving statistical 
properties of noise disturbance are studied extensively in statistics, system identification 
and traditional adaptive control. However, we shall remark that other non-statistic 
descriptions on a priori knowledge is more useful in practice yet seldom addressed in 
existing literature. In fact, in practical problems, usually the probability distribution of the 
noise/ disturbance (if any) is not known and many cases cannot be described by any 
probability distribution since noise/ disturbance in practical systems may come from many 
different types of sources. Without any a priori knowledge in mind, one frequently-used way 
to handle the noise is to simply assume the noise is Gaussian white noise, which is 
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reasonable in a certain sense. But in practice, from the point of view of engineering, we can 
usually conclude the noise/ disturbance is bounded in a certain range. This chapter will 
focus on uncertainties with non-statistical a priori knowledge. Without loss of generality, in 

this section we often regard V k = f{jj) k ) + 8 k as a whole part, and correspondingly, a priori 
knowledge on V k , (e.g. V k < V k < Vk ), should be provided for the study. 

2.2 An Example Problem 

Now we take a simple example to show that it may not be appropriate to apply traditional 
identification algorithms blindly so as to get the estimate of unknown parameter. 
Consider the following system 

Z k =00 k +f(0 k ,k) + £ k (2.2) 

where 0, f( ) and S k are unknown parameter, unknown function and unmeasurable noise, 

respectively. For this model, suppose that we have the following a priori knowledge on the 
system: 

• No a priori knowledge on 6 is known. 

• At any step k, the term f(4>k, &) is of form/(^fc>&) = exp(£ fc </> fc ) Here {&} is an 
unknown sequence satisfying < <?fc < 1. 

• Noise 8 k is diminishing with I efc I — fc. 

And in this example, our problem is how to use the data generated from model (2.2) so as to 
get a good estimate of true value of parameter 6. In our experiment, the data is generated by 
the following settings (k = 1, 2, ■ ■ -, 50): 

k 1 

& = 5 >A= — > f(A> k ) = ex P(l sin ^ I A) > £ k = y( a k -°- 5 ) 
10 k 

where {(% k } are i.i.d. taken from uniform distribution 11(0, 1). Here we have N = 50 groups 

of data (4>k>Zk). 

Since model (2.2) involves various uncertainties, we rewrite it into the following form of 

linear regression 

Zk = 0<t>k + Vk (2.3) 

by letting 

v* =/(&,*) + **■ 

From the a priori knowledge for model (2.2), we can obtain the following a priori knowledge 
for the term v\ 
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v k <v k < v k 



where 



Vk 



'^k 



exp(<£fc) + i 




1 + * , 

exp(<?!> fe ) + £ 


if fc >0 
if 0fc < 



Since model (2.3) has the form of linear regression, we can use try traditional identification 
algorithms to estimate 6. Fig. 1 illustrates the parameter estimates for this problem by using 
standard LS algorithm, which clearly show that LS algorithm cannot give good parameter 
estimate in this example because the final parameter estimation error 
§ k= 0-0n 5 .68284 is very large. 

True=5,LS=10.6828 s Error=5.68284 



5 10 15 20 25 30 35 40 45 50 

Fig. 1. The dotted line illustrates the parameter estimates obtained by standard least-squares 
algorithm. The straight line denotes the true parameter. 



One may then argue that why LS algorithm fails here is just because the term V k is in fact 

biased and we indeed do not utilize the a priori knowledge on v\. Therefore, we may try a 
modified LS algorithm for this problem: let 



^(v k 


+ 


v k ) 


1,- 

2 {Vk 


— 


Vk) 


v k - 


Cfe 




Zk — 


Ch 
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Ck = 

d k = 

w k = 
Vk = 

then we can conclude that y k = 6 (/) k + W k and W k E [—d k ,d k ~\ , where [— d k ,d k ~\is a 

symmetric interval for every k. Then, intuitively, we can apply LS algorithm to data 

{ (jp k , Z k ) , k = 1, 2, • • • ,N}. The curve of parameter estimates obtained by this modified LS 

algorithm is plotted in Fig. 2. Since the modified LS algorithm has removed the bias in the a 
priori knowledge, one may expect the modified LS algorithm may give better parameter 
estimates, which can be verified from Fig. 2 since the final parameter estimation error 

N = N —#«— 1.83314. In this example, although the modified LS algorithm can 

work better than the standard LS algorithm, the modified LS algorithm in fact does not help 
much in solving our problem since the estimation error is still very large comparing with the 
true value of the unknown parameter. 

True=5,LS2=3.16686,Error=-1 .83314 
1 2 i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ' i i 
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6 '- 

5 ' : 

4 - 
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Fig. 2. The dotted line illustrates the parameter estimates obtained by modified least-squares 
algorithm. The straight line denotes the true parameter. 
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From this example, we do not aim to conclude that traditional identification algorithms 
developed in linear regression are not good, however, we want to emphasize the following 
particular point: Although traditional identification algorithms (such as LS algorithm) are very 
powerful and useful in practice, generally it is not wise to apply them blindly when the matching 
conditions, which guarantee the convergence of those algorithms, cannot be verified or asserted a 
priori. This particular point is in fact one main reason why the so-called minimum-variance 
self tuning regulator, developed in the area of adaptive control based on the LS algorithm, 
attracted several leading scholars to analyze its closed-loop stability throughout past 
decades from the early stage of adaptive control. 

To solve this example and many similar examples with a priori knowledge, we will propose 
new ideas to estimate the parametric uncertainties and the non-parametric uncertainties. 

2.3 Information-Concentration Estimator 

We have seen that there exist various forms of a priori knowledge on system model. With the 
a priori knowledge, how can we estimate the parametric part and the non-parametric part? 
Now we introduce the so-called information-concentration estimator. The basic idea of this 
estimator is, the a priori knowledge at each time step can be regarded as some constraints of 
the unknown parameter or function, hence the growing data can provide more and more 
information (constraints) on the true parameter or function, which enable us to reduce the 
uncertainties step by step. We explain this general idea by the simple model 

z k = 9 T (f) k +v k (2.4) 

with a priori knowledge that e cz R d ,u k e V k • Then, at k-th step (k >1), with the 
current data k, (j) k ,z k we can define the so-called information set h at step k: 

I k = {0£G:z k -O T <f> k €V k }. (2 .5) 

For convenience, let Jo = ©• Then we can define the so-called concentrated information set Ck at 
step k as follows 

.7=0 (2.6) 

which can be recursively written as 

C k = C fc -i Dl k (2.7) 

with initial set Co = 0. Eq. (2.7) with Eq. (2.5) is called information-concentration estimator 
(short for IC estimator) throughout this chapter, and any value in the set C k can be taken as 

one possible estimate of unknown parameter 6 at time step k . The IC estimator differs 
from existing parameter identification in the sense that the IC estimator is in fact a set- 
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valued estimator rather than a real-valued estimator. In practical applications, generally 

C k is a domain in R , and naturally we can take the center point of C k as k . 

Remark 2.1 The definition of information set varies with system model. In general cases, it can be 

extended to the set of possible instances of (and/ or f ) which do not contradict with the data at 

step k. We will see an example involving unknown fin next section. 

From the definition of the IC estimator, the following proposition can be obtained without 

difficulty: 

Proposition 2.1 Information-concentration estimator has the following properties: 

(i) Monotonicity: C Z) C x 3 C 2 3 • • • 



(ii) Convergence: Sequence {Ck} has a limit set C^ = O C k , 

k=\ 



(Hi) If the system model and the a priori knowledge are correct, then Coo must be a non-empty set 
with property 6 G Coo and any element ofCoo can match the data and the model; 



(iv) If C^ = , then the data {(f) k , Z k } cannot be generated by the system model used by the IC 
estimator under the specified a priori knowledge. 

Proposition 2.1 tells us the following particular points of the IC estimator: property (i) 
implies that the IC estimator will provide more and more exact estimation; property (ii) 
means that the there exists a limitation in the accuracy of estimation; property (iii) means 
that true parameter lies in every C k if the system model and a priori knowledge are correct; 

and property (iv) means that the IC estimator provides also a method to validate the system 
model and the a priori knowledge. Now we discuss the IC estimator for model (2.4) in more 
details. In the following discussions, we only consider a typical a priori knowledge on 

V_ k ^V k < Vk are two known sequences of vectors (or scalar s). 

2.3.1 Scalar case: d= 1 

By Eq. (2.5), we have 

h = {o e : v k < z k - e<t> k < v k } 

Solving the inequality in h, we obtain that 

04> k G [z k — v_ k ,z k — v k ] 
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and consequently, if (j) k ^ , then we have 



where 



G [&*,&*] 



t>k 



min(sign(<^fc)(^ fc — v k ),sign(<f) k )(z k — v k )) 

\4>*\ 

max(sign(0fc)(^ fc — v k ),sign((p k )(z k — v k )) 



Here sign(x) denotes the sign of x: sign(x) = 1, 0,-1 for positive number, zero, and negative 
number, respectively. Then, by Eq. (2.7), we can explicitly obtain that 

c k = W k J k ) 

where tLk and Pk can be recursively obtained by 

P k = max(^ fc _ 1 ,6 A .) 
k = min(^ fc _i,6 fc ) 



P 4 




Fig. 3. The straight line may intersect the polygon V and split it into two sub-polygons, one 
of which will become new polygon V . The polygon V can be efficiently calculated from the 
polygon V. 
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2.3.2 Vector case: d > 1 

In case of d > 1, since 6 and (j) k are vectors, we cannot directly obtain explicit solution of 
inequality 

0^(f) k G [z k -V kJ z k -v k ] (2.8) 

Notice that Eq. (2.8) can be rewritten into two separate inequalities: 

$l'0<Zk- v k , {-(j>kY • < ~{zk - v k ) 

we need only study linear equalities of the form (/> < C . Generally speaking, the solution 
to a system of inequalities represents a polyhedral (or polygonal) domain in R d , hence we 
need only determine the vertices of the polyhedral (or polygonal) domain. In case of d = 2, it 

is easy to graph linear equalities since every inequality (j) < C represents a half -plane. In 
general case, let V k = \0 i , / = 1,2, • • • , p k ) denote the distinct vertices of the domain C k 
and p k denote the number of vertices of domain C k , then we discuss how to deduce V k 
from V k _ x . The domain C k has two more linear constraints than the domain C k _ x 



with 



«J 


.$< 


Cjfej,j = 1,2 


<Kl = <^, 




Cfc,l =Zk-Vk 


h,2 = -hi 




Cfe,2 = -(^Ife - %) 



We need only add these two constraints one by one, that is to say, 

V , k _ 1 = AddLinearContraint(Vfc_i, <^fc,i, Cfci) 
Vfc = AddLinearContraint(V^_ 1 ,^A;,2,c/ C) 2) 

where AddLinearConstraint(]/, %c){ s an algorithm whose function is to add linear constraint 
(ft 6 < C to the polygon represented by vertex set V and to return the vertex set of the new 
polygon with added constraint. 

Now we discuss how to implement the algorithm AddLinearConstraint. 

2D Case: In case of d = 2, (f) < C represents a straight line which splits the plane into two 

half -planes (see Fig. 3). In this case, we can use an efficient algorithm 
AddLinearConstraint2D which is listed in Algorithm 1. Its basic idea is to simply test each 
vertex of V to see whether to keep original vertex or generate new vertex. The time 
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complexity of Algorithm 1 is 0(s), where s is the number of vertices of domain V. Note that 

it is possible that V = if the straight line L : (ft 6 < C does not intersect with the polygon 

V and any vertex P t of polygon V does not satisfy (j) P f > C . And the vertex number of 

polygon V y can in fact vary within the range from to s according to the geometric 
relationship between the straight line L and the polygon V. 

Algorithm 1 AddLinearConstraint2D(V, (j>, c): Add linear constraint (j) T 9 < c (<^> £ U 2 ) to a polygon V 
Input: V: represented by clock- wise arranged vertices Pi, ?%, • ■ • , P s 
Output: V: clock-wise arranged vertices 

Letfl,«-P f 

2: Let Q be an empty queue 
3: for % = 1 to s do 
4: UtSi*-fPi-C 

5: end for 

6: Let #o <- $ s 
7: for i = 1 to s do 

if (^ = 0) or (<5i < and ^_i < 0) then 
Append vertex Pi to queue Q 

else if (Si > and Si-\ < 0) or (^ < and ^_i > 0) then 



Letw< 



Generate new vertex P 1 <- w - Pi + (1 - w) • P{-\ 

Append vertex P' to queue Q 
end if 
end for 
Return vertices in queue Q as the vertices of new domain V 



High-dimensional Case: In case of d > 2, (j) 9 < C represents a hyperplane which splits 
the whole space into two half-hyperplanes. 

Unlike in case of d = 2, the vertices in this case generally cannot be arranged in a certain 
natural order (such as clock-wise order). In this case, we can use an algorithm 
AddLinearConstraintND which is listed in Algorithm 2. The idea of this algorithm is to 
classify the vertices of V first according to their relationship with the hyperplane determined 

by hyperplane (j) < C . 

Algorithm 2 AddLinearConstraintND (V, ", c): Add linear constraint (j) < C (" % Rd) to a 
polyhedron V 

2.3.3 Implementation issues 

In the IC estimator, the key problem is to calculate the information set h or the concentrated 
information set Q at every step. From the discussions above, we can see that it is easy to 
solve this basic problem in case of d = 1. However, in case of d > 1, generally the vertex 
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number of domain C k may grow as k — > oo . Therefore, it may be impractical to 
implement the IC estimator in case of d > 1 since it may require growing memory as 
k — > oc To overcome this problem, noticing the fact that the domain Q will shrink 
gradually as k — > oo in order to get a feasible IC estimate of the unknown parameter 
vector, generally we need not use too many vertices to represent the exact concentrated 
information set Q. That is to say, in practical implementation of IC estimator in high- 
dimensional case, we can use a domain Q with only a small number (say up to M) of 
vertices to approximate the exact concentrated information set Q. With such an idea of 
approximate IC estimator, the issue of computational complexity will not hinder the 
applications of IC estimator. 

We consider two typical cases of approximate IC estimator. One typical case is that Ck ~2. Ok 
for any k, and the other case is that &k 2 Cu for any k. Let C^ = PiQ , then in the 

k=\ 

former case (called loose IC estimator, see Fig. 4), we must have 

oo 

^oo — I I ^k Q ^oo 
fe=l 

which means that we will never mistakenly exclude the true parameter from the 
concentrated approximate information sets; while in the latter case (called tight IC estimator, 
see Fig. 5), we must have 



Coo — I J Cfc 2 C 



k=l 

which means that the true parameter may be outside of C^ however any value in C^ can 
be served as good estimate of true parameter. 



02 

Fig. 4. Idea of loose IC estimator: The polygon P1P2P3P4P5 can be approximated by a triangle 
QiP 4 Q 2 .HereM = 3. 
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ft 




P 4 ft 

Fig. 5. Idea of tight IC estimator: The polygon P1P2P3P4P5 can be approximated by a triangle 
P3P4P5. Here M = 3. 

Now we discuss implementation details of tight IC estimator and loose IC estimator. Without 
loss of generality, we only explain the ideas in case of d = 2. Similar ideas can be applied in 
cases of d > 2 without difficulty. 

Tight IC estimator: To implement a tight IC estimator, one simple approach is to modify 
Algorithm 1 so as it just keeps up to M vertices in the queue Q. To get good approximation, 
in the loop of Algorithm 1, it is suggested to abandon the generated vertex P ' (in Line 12 of 
Algorithm 1) which is very close to existing vertex P ; (let ; = i if 5; < and 6,--i > or j = i - 1 
if dj> and 5;-i < 0). The closeness between P' and existing vertex P ; can be measured by 
checking the corresponding weight W . 

Loose IC estimator: To implement a loose IC estimator, one simple approach is to modify 
Algorithm 1 so as it can generate M vertices which surround all vertices in the queue Q. To 
this end, in the loop of Algorithm 1, if the generated vertex P ' (in Line 12 of Algorithm 1) is 
very close to existing vertex P, (let j = i if 5; < and 8;-i > or j = i - 1 if 5; > and 5/-i < 0), 
we can simply append vertex P ; instead of P' to queue Q. In this way, we can avoid 
increasing the vertex number by generating new vertices. The closeness between P' and 
existing vertex P ; can be measured by checking the corresponding weight w. 
Besides the ideas of tight or loose IC estimator, to reduce the complexity of IC estimator, we 
can also use other flexible approaches. For example, to avoid growth in the vertex number of 
Vk asfc — > 00, we can approximate V^by using a simple outline rectangle (see Fig. 6) every 
certain steps. For a polygon Vk with vertices Pi, P2, ' ' ' , P s , we can easily obtain its outline 
rectangle by algorithm FindPolygonBounds listed in Algorithm 3. Here for convenience, the 
operators max and min for vectors are defined element-wisely, i.e. 



max(A, B) = [max(Ai,Bi),max(A2,B2), • ■ • ,max(i4rf, S^)]' 3 
mm(A,B) = [mm(A u B 1 ),mm(A 2 ,B 2 )r " ,mm(A d ,B d )] T 

where ^ = K*b M> ' * ' )&d\ 5 B = [Bi, f^, ' ' * 5 Bd\ are two vectors in R n . 
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Algorithm 3 FindPolygonBoiinds(V): Find the outline rectangle of polygon V 



Input: V: represented by vertices Pi, P2, * * * , P s 

Output: V: the outline rectangle represented by upper and lower bounds of each component of vertices 
l 



Let B ^ Pi 

Let B^ Pi 
for i = 2 to 5 do 

LetB^max(B,Pi) 
LetB^min(B,P,) 
end for 

Return vertices of rectangle represented by coordinate bounds B and B as the vertices of new domain V' 




Fig. 6. Idea of outline rectangle: The polygon P l P 2 P 3 P 4 P 5 can be approximated by an 

outline rectangle. In this case, 5 1? 5i denote the lower bound and upper bound in the x- 

axis (1st component of each vertex), and i? 2 , B i denote the lower bound and upper bound 
in the y-axis (2nd component of each vertex) 



2.4 IC Estimator vs. LS Estimator 

2.4.1 Illustration of IC Estimator 

Now we go back to the example problem discussed before. For this example, (j) k and Zk are 
scalars, hence we need only apply the IC estimator introduced in Section 2.3.1. Since IC 
estimator yields concentrated information set C k at every step, we can take any value in 
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C k as parameter estimate of true parameter. In this example, C k is an interval at every 

step step. For comparison with other parameter estimation methods, we simply take 

1 - 
6 k = —(bk + b k ) , i.e. the center of interval C k , as the parameter estimate at step k. 

In Fig. 7, we plot three curves bk , b k and 6 k . From this figure, we can see that, for this 

particular example, with the help of a priori knowledge, the upper estimates bk and lower 
estimates b k given by the IC estimator converge to true parameter 6 = 5 quickly, and 

consequently 6 k also converges to true parameter. 



5 10 15 20 25 30 35 40 45 50 

Fig. 7. This figure illustrates the parameter estimates obtained by the proposed information- 
concentration estimator. The upper curve and lower curve represent the upper bounds bk 
and lower bounds b k for the parameter estimates. We use the center curve 

k = — \p k + b k J to yield the parameter estimates. 



We should also remark that the parameter estimates given by the IC estimator are not 
necessarily convergent as in this example. Whether the IC parameter estimates converge 
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largely depend on the accuracy of a priori knowledge and the richness of the practical data. 
Note that the IC estimator generally does not require classical richness concepts (like 
persistent excitation) which are useful in the analysis of traditional recursive identification 
algorithms. 

2.4.2 Advantages of IC Estimator 

We have seen practical effects of IC estimator for the simple example given above. Why can 
it perform better than the LS estimator? Roughly speaking, comparing with traditional 
identification algorithm like LS algorithm, the proposed IC estimator has the following 
advantages: 

1. It can make full use of a priori information and posterior information. And in the ideal 
case, no information is wasted in the iteration process of the IC estimator. This property is 
not seen in traditional identification algorithms since only partial information and certain 
stochastic a priori knowledge can be utilized in those algorithms. 

2. It does not give single parameter estimate at every step; instead, it gives a (finite or 
infinite) set of parameter estimates at every step. This property is also unique since 
traditional identification algorithms always give parameter estimates directly. 

3. It can gradually find out all (or most) possible values of true parameters; and this 
property can even help people to check the consistence between the practical data and the 
system model with a priori knowledge. This property distinguishes traditional identification 
algorithms in sense that traditional identification algorithms generally have no mechanism 
to validate the correctness of the system model. 

4. The a priori knowledge can vary from case to case, not necessarily described in the 
language of probability theory or statistics. This property enables the IC estimator to handle 
various kinds of non-statistic a priori knowledge, which cannot be dealt with by traditional 
identification algorithms. 

5. It has great flexibilities in its implementation, and its design is largely determined by the 
characteristics of a priori knowledge. The IC estimator has only one basic principle - information 
concentration! Any practical implementation approach using such a principle can be 
regarded as an IC estimator. We have discussed some implementation details for a certain 
type of IC estimator in last subsection, which have shown by examples how to design the IC 
estimator according the known a priori knowledge and how to reduce computational 
complexity in practical implementation. 

6. Its accuracy will never degrade as time goes by. Generally speaking, the more steps 
calculated, the more data involved, and the more accurate the estimates are. Generally 
speaking, traditional identification algorithms can only have similar property (called strong 
consistency) under certain matching conditions. 

7. The IC estimator can not only provide reasonably good parameter estimates but also tell 
people how accurate these estimates are. In our previous example, when we use 

k — — \b k + b k las the parameter estimate, we know also that the absolute parameter 
estimation error \§ I - \§ _ Q I will not exceed — (A + £ I. In some sense, such a property 

| III r\ \ K K_) 

may be conceptually similar to the so-called confidence level in statistics. 
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2.4.3 Disadvantages of IC Estimator 

Although the IC estimator has many advantages over traditional identification algorithms, it 
may have the following disadvantages: 

1. The proposed IC estimator is relatively difficult to incorporate stochastic a priori 
knowledge on noise term, especially unbounded random noise. In fact, in such cases 
without non-parametric uncertainties, traditional identification algorithms like LS algorithm 
may be more suitable and efficient to estimate the unknown parameter. 

2. The efficiency of IC estimator largely depends on its implementation via the 
characteristics of the a priori knowledge. Generally speaking, the IC estimator may involve a 
little more computation operations than recursive identification algorithms like LS 
algorithm. We shall remark also that this point is not always true since the numerical 
operations involved in the IC estimator are relatively simple (see algorithms listed before), 
while many traditional identification algorithms may involve costly numerical operations 
like matrix product, matrix inversion, etc. 

3. Although the IC estimator has simple and elegant properties such as monotonicity and 
convergence, due to its nature of set-valued estimator, no explicit and recursive expressions can 
be given directly for the IC parameter estimates, which may bring mathematical difficulties 
in the applications of the IC estimator. However, generally speaking, we also know that 
closed-loop analysis for adaptive control using traditional identification algorithms is not 
easy, too. 

Summarizing the above, we can conclude that the IC estimator provides a new approach or 
principle to estimate parametric and even non-parametric uncertainties, and we have shown 
that it is possible to design efficient IC estimator according to characteristics of a priori 
knowledge. 

3. Semi-parametric Adaptive Control: Example 1 

In this section, we will give a first example of semi-parametric adaptive control, whose 
design is essentially based on the IC estimator introduced in last section. 

3.1 Problem Formulation 

Consider the following system 



J/t+1 = Q<f>t + f(Vt) +Ut + W t +i 



(3.1) 



where y t/ u t and w t are the output, input and noise, respectively; f{^)E.F(L) is an 

unknown function (the set F(L) will be defined later) and is an unknown parameter. To 
make further study, the following assumptions are used throughout this section: 
Assumption 3.1 The unknown function f '. R — > R belongs to the following uncertainty set 

HL) = {f:\f(x)-f(y)\<L\x-y\ + c} (3 . 2) 
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where c is an arbitrary non-negative constant. 
Assumption 3.2 The noise sequence {w t } is bounded, i.e. 

\w t \ <W 

where w is an arbitrary positive constant. 

Assumption 3.3 The tracking signal {y t } is bounded, i.e. 

\y* t \<S,Vt>0 (3 . 3) 

where S is a positive constant. 

Assumption 3.4 In the parametric part 0(/) t , we have no any a priori information of the unknown 

parameter 6, but (j) t = g(y t ) is measurable and satisfies 

M /< | g(^a) | <M 

— I x\-xl " - (34) 

for any X x ^ X 2 , where M <M are two positive constants and b>lis a constant. 
Remark 3.1 Assumption 3.4 implies that function g( ■) has linear growth rate when b = 1. Especially 
when g(x) = x, we can take M = M' = 1. Condition (3.4) need only hold for sufficiently large x\ and 
xi, however we require it holds for all xl ¥> x2 to simplify the proof. We shall also remark that Sokolov 
[Sok03] has ever studied the adaptive estimation and control problem for a special case of model (3.1), 

where <ft t is simply taken as ay t . 

Remark 3.2 Assumption 3.4 excludes the case where g( ■) is a bounded function, which can be 

handled easily by previous research. In fact, in that case w\ +l = 6(/> t + W t+l must be bounded, 

hence by the result of[XG00], system (3.1) is stabilizable if and only if L < — h -Jl • 

2 

3.2 Adaptive Controller Design 

In the sequel, we shall construct a unified adaptive controller for both cases of b =1 and b >1. 
For convenience, we introduce some notations which are used in later parts. Let I = [a, b] be 

A 1 

an interval, then Tn(I^) = — (<2 + Z>) (a+ b) denotes the center point of interval I, and 

{r\ Al \u I 
r[l )= — \b — a\ denotes the radius of interval I. And correspondingly, we let 

/(x, 8) = [X — S, X + 8\ denote a closed interval centered atiei? with radius 5 > 0. 

Estimate of Parametric Part: At time t, we can use the following information: yo, yi, ' ' ' , yt, 

wo, ui, ■ ■ ■ , u t -i and ^ , ^ 2 •> " ' •> $t • Define 
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A 
Z 3 ~ Vj+1 ~ U 3 (3.5) 

and 

T = n T( z 3~ Zi L \yi-Vi\ i _2w±c_\ 

1 " ,. !' T ±K h-4>i' \<t>j-<f>i\ ^ \h-4n\t 

( % ,3)£Jt (3.6) 



where 



then, we can take 



J t = {{iJ)eM:i<j<t,(j) z ^^} 



(3.7) 



t = m(I t ), S t = r(I t ) (3<8 ) 

as the estimate of parameter 6 at time t and corresponding estimate error bound, 
respectively. With 0± and 5 t defined above, 9t — t + 8 t and 0_ t =0 t —3 t are the 
estimates of the upper and lower bounds of the unknown parameter 6 , respectively. 
According to Eq. (3.6), obviously we can see that {Ot} is a non-increasing sequence and 
{0 t } is non-decreasing. 

Remark 3.3 Note that Eq. (3.6) makes use of a priori information on nonlinear function /( ■). This 
estimator is another example of the IC estimator which demonstrates how to design the IC estimator 
according to the Lipschitz property of function /(■). With similar ideas, the IC estimator can be 
designed based on other forms of a priori information of function f( ■). 

Estimate of Non-parametric Part: Since the non-parametric part f(y t ) may be unbounded 

and the parametric part is also unknown, generally speaking it is not easy to estimate the 
non-parametric part directly. To resolve this problem, we choose to estimate 

9t = 9<t> t + f(y t ) 

as a whole part rather than to estimate f(yi) directly. In this way, consequently, we can 
obtain the estimate of f(y t ) by removing the estimate of parametric part from the estimate of 



Define 



then, we get 



it =argmm|y t -y%\ 

i<t (3.9) 
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9t =9t~ z it + z it 

= [Ht + f(yt)] - [OK + f(y it ) + w it+1 ] + z it 

= [9((j) t - (j> it ) + z it ] + [f(y t ) - f(y it ) - w it+1 ] (310) 

Thus, intuitively, we can take 

9t = t {<f)t ~<t>u) + z h = h {4>t -4Ht) + (Vu+i - Ui t ) (3.11) 

as the estimate of g t at time t . 
Design of Control u t : Let 

b t = maxy^ = max(6 t -i,2/t) 

i<t 



h = niiny^ = min^.^yt) 



Under Assumptions 3.1-3 A, we can design the following control law 

7 , f -gt + vUi ti\yt-yu\ < D 

'""1 -9t + Hbt + k t ) if\y t -y it \>D 



(3.12) 



(313) 



where D is an appropriately large constant, which will be addressed in the proof later. 
Remark 3.4 The controller designed above is different from most traditional adaptive controllers in 
its special form, information utilization and computational complexity. To reduce its computational 
complexity, the interval I t given by Eq. (3.6) can be calculated recursively based on the idea in Eq. 
(3.12). 

3.3 Stability of Closed-loop System 

In this section, we shall investigate the closed-loop stability of system (3.1) using the 
adaptive controller given above. We only discuss the case that the parametric part is of 
linear growth rate, i.e. b = 1. For the case where the parametric part is of nonlinear growth 
rate, i.e. b > 1, though simulations show that the constructed adaptive controller can stabilize 
the system under some conditions, we have not rigorously established corresponding 
theoretical results; further investigation is needed in the future to yield deeper 
understanding. 

3.3.1 Main Results 

The adaptive controller constructed in last section has the following property: 

Theorem 3.1 When b = 1, < — h a/2 , the controller defined by Eqs. (3.5)- (3.13) can 

M' 2 
guarantee that the output fyt} of the closed-loop system is bounded. More precisely, we have 
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lim sup \yt ~ Vt\ < 2w + c. 

t^oo (314) 

Based on Theorem 3.1, we can classify the capability and limitations of feedback mechanism 
for the system (3.1) in case of b = 1 as follows: 

Corollary 3.1 For the system (3.1) with both parametric and non-parametric uncertainties, the 
following results can be obtained in case ofb = 1: 

(i) If b = 1 < — |- ^/2 , then there exists a feedback control law guaranteeing that the closed- 

M' 2 
loop system is stabilized. 

(ii) When (j) t = y t (i.e. g{x) = X ), the presence of uncertain parametric part 6(j) t does not reduce 

the critical value — h V 2 of the feedback mechanism which is determined by the uncertainties of 
non-parametric part. 

V roof of Corollary 3.1: (i) This result follows from Theorem 3.1 directly, (ii) When g(x) = x, we 
can take M = M' = 1. In this case, the sufficiency can be immediately obtained via Theorem 
3.1; on the other hand, the necessity can be obtained by the "impossibility" part of Theorem 

1 in [XGOO]. In fact, if Z > — h V2 , for any given control law {u t }, we need only take the 

parameter 6 = 0, then by [XGOO, Theorem 2.1], there exists a function /such that system (3.1) 
cannot be stabilized by the given control law. 

Remark 3.5 As we have mentioned in the introduction part, system (1.6), a special case of system 
(3.1), has been studied in [XGOO]. Comparing system (3.1) and system (1.6), we can see that system 
(3.1) has also parametric uncertainty besides nonparametric uncertainty and noise disturbance. 
Hence intuitively speaking, it will be more difficult for the feedback mechanism to deal with 
uncertainties in system (3.1) than those in system (1.6). Noting that M'< M, we know this fact has 
been partially verified by Theorem 3.1. And Corollary 3.1 (ii) indicates that in the special case of 

(j) t — y t , since the structure of parametric part is completely determined, the uncertainty in non- 
parametric part becomes the main difficulty in designing controller, and the parametric uncertainty 
has no influence on the capability of the feedback mechanism, that is to say, the feedback mechanism 

can still deal with the non-parametric uncertainty characterized by the set F(L) with L < — h V 2 . 

Remark 3.6 Theorem 3.1 is also consistent with classic results on adaptive control for linear systems. 
In fact, when L = 0, the non-parametric part f(y t ) vanishes, consequently system (3.1) becomes a 
linear-in-parameter system 

y t+i =00 t +U t +W t+i (3.15) 
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where is the unknown parameter, and (j) t — g(y t ) can have arbitrary linear growth rate because 

by Theorem 3.1, we can see that no restrictions are imposed on the values of M and M x when L = 
0. Based on the knowledge from existing adaptive control theory [CG91], system (3.15) can be always 
stabilized by algorithms such as minimum-variance adaptive controller no matter how large the 9 is. 
Thus the special case of Theorem 3.1 reveals again the well-known result in a new way, where the 
adaptive controller is defined by Eq. (3.13) together with Eqs. (3.5) - (3.12). 

ML 3 r- 

Corollary 3.2 Ifb= 1, < — h V2, C = W = , then the adaptive controller defined by Eqs. 

(3.5) - (3.13) can asymptotically stabilize the corresponding noise-free system, i.e. 

lim \y t -Vt\ = 0. 
t^oo (3.16) 

3.3.2 Preliminary Lemmas 

To prove Theorem 3.1, we need the following Lemmas: 

Lemma 3.1 Assume {x n } is a bounded sequence of real numbers, then we must have 

lim min \x n — x%\ = 0. 

n— >oo i<n (3.17) 

Proof: It is a direct conclusion of [XG00, Lemma 3.4]. It can be proved by argument of 
contradiction. 

Lemma 3.2 Assume that Ze(0, — \-yJ2),d>0,rir ) >0 • If non-negative sequence {h n , n > 0} 

satisfies 

( 1 U \ + 

hn+i < I L max hi — — > hi + d I , Vn > 

\ i<n 2 ^ — J I 

\ ~ i=Q / 



tiq 



(3.18) 



A 

where X = max(x,0), Vx G R , then we must have 



lim > hi < oo. 



i=Q (3.19) 



Proof: See [XG00, Lemma 3.3]. 



3.3.3 Proof of Theorem 3.1 

Proof of Theorem 3.1: We divide the proof into four steps. In Step 1, we deduce the basic 
relation between y t +i and @t, and then a key inequality describing the upper bound of 

| y t — y i I is established in Step 2. Consequently, in Step 3, we prove that | y t — y i |— > 
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as t — > oc if y t is not bounded, and hence the boundedness of output sequence {y t } can be 
guaranteed. Finally, in the last step, the bound of tracking error can be further estimated 
based on the stability result obtained in Step 3. 
Step 1: Let 

Ot =6- § t 

4£ A 

Vt+l = && + f(Vt) + W t +i - 9t (3i 20) 

then, by definition of ut and Eq. (3.13), obviously we get 



_ yf+i + vUi ft\yt-Vh\<D 

y* +1 + Ubt+k t ) if \y t -y H \>D 



§ 

Now we discuss y t+l . By Eq. (3.11) and Eq. (3.1), we get 



(3.21) 



yf+\ = Ht + f(yt) + w+i - 9t 

= $4> t + f(y t ) + w t+ i - e t {<t> t - 4> it ) 

-(0<t> it +f(y it )+w it+1 ) (3.22) 

= {o - d t )(<t> t - K) 

+[f(vt) - f(yit)] + K+i - w u+i) 
= 0t(4>t - <t>i t ) + [f(yt) - f(vi t )] + ( w t+i - wu+i) 



In case of (f) t = <f) i , i.e. yt = yu , obviously we get 

\Vt+i\ = \m+i -u>u+i\ < 2 ^; 



(3.23) 



(3.24) 



otherwise, we get 

yf+i = {0 t + D tM ){(JH - 0ij + K+i - w it+ i) 

where 

D A f(yj) - f(yj) 

1,3 ~ fa - <t> 3 ' 

Obviously D. = D .. . In the latter case, i.e. when (j) t ^ ^ , for any (/, f) G J t , noting that 
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Zj -Zi = (y j+ i - Uj) - {y i+1 - Ui) 

+K+i - m+i] 



(3.25) 



we obtain that 



e _ z 3 - Zj =D _ Wj+i - Wi+1 

<f>3 ~ & <t>3 ~ & (3.26) 

Therefore 

9, + Dtt, = («-^)-(4-^) + Aa 

where 

</>j <?i (328) 

ML 3 n- A MI 3 r- 

Step 2: Since < — h V 2 , there exists a constant £ > (J such that h £ < — h V 2 . 

M ? 2 M' 2 



(3.27) 



Let 



Bt^lk^Afi^Bt-Bt-! 



(3.29) 
and consequently 

l-^tl = &£ - &tJ l^^l = 1^1 ~ l-^t-ll (3.30) 

By the definitions of b t , b t and i?^ , we obtain that 

id I / \Bt\ ifife+i€B t 

1 t+l1 UN + l2/*+i - s& + **)l if y^ i B * (3.3D 

By the definition of i t , obviously we get 
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, _ , f =\ABt\ ifytgBt-t 

lift ^hI|<| AjB .| tf yt e AB . c Bt _ 1 ^^ 

Step 3: Based on Assumption 3.4, for any fixed £ > , we can take constants D andD ' such 

that \ d — (h . \> D > when | y t — y. \> D . Now we are ready to show that for 

j £ - , 

any s > 0, fere always exists t> s such that \ y t — y f \> D . 

In fact, suppose that it is not true, then there must exist s > such that | y t — y i |> D for 

any t > s, correspondingly \(j) t — (f) it > D'. Consequently, by the definition of D, for 
sufficiently large t and ; < t, we obtain that 



,Wj +1 -w i+1 2w. 1 
1 fc - <t>u ~ Df 4M 



together with the definition of t , we know that for any s <i<j < t, 
hence for S < j <t,i = i ■ , we get 



(3.33) 



(3.34) 



i* ,m ia /m L 2w + c L 1 

I ^Ul I W WU M ,T jy ~ M ^ iM (335) 

Now we consider ZX . — Z) . . . 

i ,i t j ,ij 

Let d n = D n . , then, by the definition of Dy, noting that | j; . -J. |>| J/ . -J. |> D for 
any ; > s, we obtain that 

.£. i = | /(ife)-/(yj) . . I gg - yj I < _^ 

*' J Vi"Vj ^i-^j " M>1 (3.36) 

so we can conclude that {d n , n > s} is bounded. Then, by Lemma 3.1, we conclude that 

lim min \d t - dA — 0. 

t^oos<j<t (337) 



48 Adaptive Control 



Consequently there exists s' > s such that for any t > s ', we can always find a corresponding 
j=j(t) satisfying 

\Dt ,• — Djj.\ = \dt — dA < e. 

Summarizing the above, for any t > s ', by taking; = j(t), we get 

ft + D ttit \ = |Aa - A,j - ^^ + A^j(t)| 

<|A i4t _i? w | + |a^tti| + |A^ J (t)| 
^+jfe« + (£ + dA (339) 



Therefore 



„# 



yf+il = m + D t , it )^(y t -y it ) + (w t+1 -w it+1 ) 
< L e M\y t -y it \ + 2w. 



Since | y t - yu \ > D, we know that 



(3.40) 



Vt+i = yf+i + g ( fe * + fc)- (3 41) 

From Eq. (3.39) together with the result in Step 2, we obtain that 

l-Bel < |-B*+i| < max{|B f |^|B £ | + |y f+ i-Jfe + 6»)|} 

= ma X {|S i |,i|B i | + |j / f +1 |} (342) 

Thus noting (3.40), we obtain the following key inequality: 

|ABt| < {L e M\y t -y it \ + 2w-±\B t \)+ (343) 

where 

L € M = (^ + ^e)M = M£ + f e < f + y/2. 



(3.44) 



Considering the arbitrariness of t > s', together with Lemma 3.2, we obtain that 
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£|ABil<oo, 

*>*' (3.45) 

and consequently {\B t \] must be bounded. By applying Lemma 3.1 again, we conclude 
that 

\y t -y it \ <min|j/t-J/t| ^0 

Kt (3.46) 

which contradicts the former assumption! 

Step 4: According to the results in Step 3, for any s > 0, there always exists t > s such that 

| y t — y t |< D . Then, we can easily obtain that { | u t \ } is bounded, say | t |< L . 
Considering that 

yf+i = Ufa - K) + [/(y*) - f(%)\ + Ki - «>h+i) (3.47) 

we can conclude that 

Iw+il < \vt+i + Vt+i\ 

< L f \(f> t -<j) it \ + {L\y t -y it \+c) + 2w 

- Y (3.48) 

where Y = VMD + LD + c + 2w + S. 

The proof below is similar to that in [XG00] . Let 

k = in) \{t :\y t \< Y}, t n = inf {t :\y t \< Y}. 

t>0 t>t n -i (3.49) 

Because of the result obtained above, we conclude that for any n>l,t n is well-defined and t n 
< oo. Let V n = y t , then obviously {v n j is bounded. Then, by applying Lemma 3.1, we get 

mink-^|->0 

%<n (3.50) 

as fl — > oc . Thus for any 6* > , there exists an integer no such that for any n > no, 

min|t; n -t^| < e. 

t<n (3.51) 



So 



\Vt n ~ Vit n I = ™ n |jfe« - J/*l < ™ n |lfc« " Vti I < £ ' 

z<i n z<n (352) 
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By taking £ sufficiently small, we obtain that 

|tfe n+ i | < UMs + Le + c + 2w + S <Y (3 53) 

for any n > no. 

Thus based on definition of t n , we conclude that t n +i = t n + V. Therefore for any t > t > 

\Vt\<Y (3.54) 

which means that the sequence /yj is bounded. 

Finally, by applying Lemma 3.1 again, for sufficiently large t, \y t ~ y i |< £ consequently 

\Vt+i - y*t+i I = b* 1 1 < L'Me + Le + c + 2w. (3 55) 

Because of arbitrariness of 8 , Theorem 3.1 is true. 

3.4 Simulation Study 

In this section, two simulation examples will be given to illustrate the effects of the adaptive 
controller designed above. In both simulations, the tracking signal is taken as 

y* = 10 sin — and the noise sequence is i.i.d. randomly taken from uniform distribution 

10 
11(0, 1). The simulation results for two examples are depicted in Figure 8 and Figure 9, 
respectively. In each figure, the output sequence Vt and the reference sequence Vt are 

A 

plotted in the top-left subfigure; the tracking error sequence e t =y t — y t is plotted in the 

bottom-left subfigure; the control sequence U t is plotted in the top-right subfigure; and the 

parameter 6 together with its upper and lower estimated bounds is plotted in the bottom- 
right subfigure. 
Simulation Example 1: This example is for case of b = 1, and the unknown plant is 

yt+i = f{yt) + Og{y t ) + w t+ i, /(•) e T{L) (3 56) 

with L = 2.9<- + 42,g(x) = x (i.e. b = \,M =M'=1) 

and 

f(x) = 1.4c sin log(|&| + 1). (357) 

For this example, we can verify that 
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\f{x)\ = 1.4| sinlog(|a;| + 1) ± ^ cos ^ l+1) | < 2.8 < L 

consequently | f(x) - f(y) \<L\x-y\, i.e. /(•) e F(L) .. 

Simulation Example 2: This example is for case of b > 1, and the unknown plant is 

with Z = 2.9 , g{x) = x 2 (i.e. b = 2 , M = M = 1 ), and 

f(x) — 2x + sinrr 2 . 



(3.58) 



(3.59) 



(3.60) 



For this example, we can verify that | f(x) - f(y) \< L \ x - y \ +2 , i.e. f (•) g F (L) . 
From the simulation results, we can see that in both examples, the adaptive controller can 
track the reference signal successfully. The simulation study verified our theoretical result 
and indicate that under some conditions, the adaptive control law constructed in this paper 
can deal with both parametric and non-parametric uncertainties, even in some cases when 
the parametric part is of nonlinear growth rate. In case of b = 1, the stabilizability criteria 
have been completely characterized by a simple algebraic condition; however, in case of b > 
1, it is very difficult to give complete theoretical characterization. Note that usually more 
accurate estimate of parameter can be obtained in case of b > 1 than in case of b = 1, 
however, worse transient performance may be encountered. 



output: y and y* 






20 
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parameter and bounds estimation 



20 40 60 80 100 20 

Fig. 8. Simulation example 1: (g(x) = x, b = 1,M = M' = 1) 
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output: y and y* 



control: u 






parameter and bounds estimation 



100 




100 



Fig. 9. Simulation example 2: (g(x) = x% b = 2,M = M'=l) 



4. Semi-parametric Adaptive Control: Example 2 

In this section, we shall give another example of adaptive estimation and control for a semi- 
parametric model. Although the system considered in this section is similar to the model 
considered in last section, there are several particular points in this example: 

• The controller gain in this model is also unknown with a priori knowledge on its sign and 
its lower bound. 

• The system is noise-free, and correspondingly the asymptotic tracking is rigorously 
established in this example. 

• The algorithm in this example has a form of gradient algorithm, however, it partially 
makes use of a priori knowledge on the non-parametric part. 

• Due to the limitation of this algorithm and technical difficulties, unlike the algorithm in 
last section, we can only establish stability of the closed-loop system under condition 

< L < 0.5 for the parametric part, which is much stronger than the condition 

0<Z<-+V2 

2 



This example is given here only for the purpose of demonstrating that there exist other 
possible ways to make use of a priori knowledge on the parametric uncertainties and non- 
parametric uncertainties. By comparing the examples in this section and last section, the 
readers may get a deeper understanding to adaptive estimation and control problems for 
semi-parametric models. 



Adaptive Estimation and Control for Systems with Parametric and Nonparametric Uncertainties 53 

4.1 Problem Formulation 

We consider the following system model 

y k+1 = 0$(y(k)) + f(y k ) + bu k (41) 

where y k G R and U k G R are output and control signals, respectively. Here 6 G R is 

the unknown parameter, b G R is the unknown controller gain, O(-) is a known function, 

and /( ) is the unknown function. We have the following a priori knowledge on the real 
system: 

Assumption 4.1 The nonparametric uncertain function /( ■) is Lipschitz, i.e., 
1 1 f( x \ ) _ f( x 2 ) 1 1- ^ 1 1 x i ~~ x 2 1 1' ^ x i 5 x 2 E ^ ' w/zere L < 0.5. The known function O(-) zs also a 
Lipschitz function with Lipschitz constant L. 

Assumption 4.2 The sign of unknown controller gain b is known. Without loss of generality, we 
assume that b>b>0 where b is a known constant. 

Assumption 4.3 The reference signal y k is a known bounded deterministic signal. 

The control objective is to design the control law U k such that the output signal yu 

asymptotically tracks a bounded reference trajectory y k and all the closed-loop signals are 
guaranteed to be bounded. 

4.2 Adaptive Control Design 

To design the adaptive controller, the following notations will be used throughtout this 
section: 

l k = argminly/e - y t \ 
l<k-l 

e k = y k - y% (42 ) 

Obviously, at time step k, with the history information fyy, j < k} and the a priori knowledge, 
the index l k and the tracking error e k dire available. Later we will see important roles of 

l k and e k in the controller design. 

Estimation of parametric part: The estimates of the parameter 9 and the controller gain b at 

time step k are denoted by &k and ^fc, respectively. We design the following adaptive 
update law to update the parameter estimates recursively: 
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a a a k e k [^ (yk-i) ~^(vi k _ 1 )] 
k = 0jfe_i+7 



n 



D k -1 

if 61 > b 



k X1 u k 

b otherwise 



■ia k e k 

D k -! 



b' k = b k -! +Proj & {^- -[u fc _i -u tk _ 1 }} 

1 „ , , 



Dk-i = -\\®{y k -i) -Hvi k -M + 2K-1 -ui k _,y 

where < y < 1 and the coefficient CL k is defined by a time-varying deadzone: 



{ \\yk-i-yi k _ x \ .ni i ^ m i 

1 F^ if |c fc | > Alyfc-x-tft^J 
otherwise 



(4.4) 



Estimation of non-parametric part: As in last section, we do not estimate the non- 
parametric part directly. Instead, we try to estimate the parametric part and non- 
parametric part as a whole part 



(4.5) 



(4.6) 



y* = 9$(y k ) + f(y k ) 

Noticing of the system model (4.1), we know that 

Vh+i = y* + bu k 

consequently, from Eqs. (4.5) and (4.6), it is easy to derive 

y* = vt-vi + vt 

= «y*)-*(w h )] + /(y*)-/(wj ( 4 - 7 ) 

+yi k +i - bu h 

Since function /( •) is unknown and parameters and b are unknown, we simply estimate 
y k by the following eqution 



y* = dk[®(yk) - ®(vi k )] + yi k +i - hu lk 



(4.8) 



where @k and &k are regarded as true parameters, and the unknown term 
f(y k )~ f(ji t ) in Eq- ( 47 ) is sim P l Y dipped off. 
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Adaptive control law: By Eq. (4.6), according to the certainty equivalence principle, we can 
design the following adaptive control law 

Uk = -J-(V* -Vk+i) 

Ok (4.9) 

Where b k and y k are given by Eqs. (4.3) and (4.7). The closed-loop stability will be given 
later. 

4.3 Asymptotic Tracking Performance 

4.3.1 Main Results 

Theorem 4.1 In the closed-loop system (4.1) with control law (4.9) and parameters adaptation law 
(4.3), under Assumptions 4.1 -4.3, all the signals in the closed-loop system are bounded and further 

the tracking error e k will converge to zero. 



4.3.2 Preliminaries 

Definition 4.1 Let X k and y k (k > ) be two discrete-time scalar or vector signals. 

• We denote X k —0[y k ], if there exist positive constants m\ and mi such that ||r. \\<m l 
max . <k || y . \\ +m 2 > V& > k and ko is the initial time step. 

• We denote X k = o\_y k ] , if there exists a sequence CC k satisfying lim^^ (X k -^ such that 
|| x k ||< m x max 7 .<, || y i || +w 2 , Vk>k Q . 

• We denote X k - y k if they satisfy X k = 0[y k ] and y k = 0[x k ] . 

Lemma 4.1 Consider the following parameter update law 

e k = IK ifK>» 

y fi otherwise /a -i q\ 

e f k = § k -i+Proj § (7] k ) ( 411 ) 



mm = { ;»• 



ifOk-i = iiandrik < 
otherwise (avt\ 
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where 6 G R is an unknown scalar, 6 k is its estimate at time step k , }iis the lower bound of 6, 
and T] k G R is any sequence. Then, k > JU is guaranteed and the following properties hold: 

9% > §1, Projj( m ) = rjt, Oj(PrvJe(rik) - m) < 

where k = k ~ 9 and k = k ~ . 

Proof: According to Eqs. (4.10) and (4.11), it is obvious that 9 k > JU always hold. From Eq. 

(4.12), we see that | Prqj^T^) |=| TJ k | , hence Proj 2 - (rj k ) = rf k ■ Further, we have 

4-i(Proj^(r/ fc )-r/ fc ) 

f (4-i - 0)(-7] k - 7] k ) < 0, if 4-1 = fi means (4-i - 9) < and rj k < 
\ (4-i - 0)(Vk - rj) = otherwise 

From (4.10), we see that 6 k =6 k it 6 k > JU such that k = 6 k when 6 k > JU . Noticing 
that when k < // , we have JU < 9 , so that 

> (p-0) 2 = k -fff = §l (413) 

Therefore, we always have k > k . This completes the proof. 

Lemma 4.2 Given a bounded sequence X k €E R . Define 

l k = arg min \\X k - X t \\ 

l<k — l 
Then, we have 

lim \\X k -X lk \\=0 

k^oo 
Proof: This lemma is an extension of Lemma 3.1. Its proof can be found in [Ma06]. 

Lemma 4.3 (Key Technical Lemma)Let {s t } be a sequence of real numbers and \cr t ) be a sequence 
of vectors such that 
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Iktll <ci + c 2 max \s k \ 

0<k<t 
Assume that 

S 2 

z t = — > as t — > oo 



where CC X > 0,(X 2 > . Then \\c t \\is bounded. 



Proof: This lemma can be found in [AW89, GS84]. 



4.3.3 Proof of Theorem 4.1 

Define parameter estimate errors ®k = ^k ~ and "k = "k ~ ". From Eqs. (4.7) and (4.8), 



we have 



V k = Vk~ vl 



= 6 k \§{y k ) - <5>(y h )} - f(y k ) + f(y h ) - b k u h (4 14) 



Then, we can derive the following error dynamics: 

= yf + ^fcWfc - &fc«fc - yl+i 

~# i (415) 

= -y£ - b k u k 

= -0 k [&(Vk) - *(vzj] + f(Vk) - f(vi h ) - h[u k - u lh ] 

According to Assumption 4.1, we have 

\f(vk) - f(yi k )\ < Mvk - yi k \ (4 .i6) 

where X can be any constant satisfying L < A < 0.5. 
From the error dynamics Eq. (4.15), we have 

e k = -4-i[*(y*-i)-^(w h -i)] + /(yfc-i)-/(lftjb-i) 

-h-i[u k -i - u h _ x ] (A17) 
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Choose Lyapunov function candidate as 

V k = 9 k + b k ( 4 18 ) 

From the adaptation laws (4.3), we obtain that 

%-%-i = 7 2 |^[^-i)-^,_ 1 )] 2 + 2 7 f^[$(y fe _ 1 )-$( ?/ife _ 1 )] 

u k-l u *-l (4.19) 



~2 ~2 ^ ~/2 -2 

9k ~ 9k-i < 9k ~ 9k- 



1 (4.20) 



= 7 2 #-K-i-^_ 1 ] 2 + 2 7 ^K_ 1 -^_ 1 ] (4.21) 

U k-\ D k-l 

Together with the error dynamics Eq. (4.17), we can derive that the difference of Vjt 

Uk-l (4.22) 

Noting that < ( < 1 and taking summation on both hand sides of Eq. (4.22), we obtain 

00 2 2 

5> 7 (l-7)-^<V(0)-V(oc) 
^o Dk ~ l 

Which implies 

a 2 e 2 
lim ** h = 
fc-oo n k _ 1 (4.23) 

and the boundedness of @h and &k. Considering Vk ^ €>k, we have 

\yk-\\ < max{|e j |} + C 2 , k > k 

where and C2 are some constants. From the definition of deadzone in Eq. (4.4), we have 
|efc| - \\yk-i - Vl h -i\ < a>k\ek\. 

Therefore, we have 
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llfc-il < max{|e J -|} + C 2 

= m &H\ej\ - Mvj-i - yij-i I + Mvj-i - yij-i I) + c 2 (424) 

< max{flu|e,-|} + A max {\yj - y t |} + C 2 , k > k 

j<k j<k — l 

Therefore, we have 

max {\yj\} < max{aJej|} + 2A max {\yj\} + C3, k > k® 

j<k-l j<k j<k-l (4,25) 

Note that X < 0.5, we have 



1 C 3 

/^k-i^ Vj ^ ~ 1 - 2X X f<k^ aj ^ ej ^ + 1 -2A ' fc > fc ° 



(4.26) 



holds for all X < X*, where C3 is some finite number. Note that inequality Eq. (4.26) means 
that y k _ x = 0\a k e k ] . Further we have 

£>!_! < Mvk-i) - ${yi k ^)\ + \u k -i - ui^ I - 0[j/ fe _i] = 0[a k e k ] 

Therefore, we can apply the Key Technical Lemma (Lemma 4.3) to Eq. (4.23) and obtain that 

lim ahCh = 

k^oo (4.27) 

which guarantees the boundedness of yu from Eq. (4.26) and thus, the boundedness of 
output yic tracking error e\. Therefore, applying Lemma 4.2 yields 

££>-Wj=0 (4.28) 

Next, we will show that lim^^ a k e k —> leads to lim^^ e k —> . From the 
definition of deadzone in Eq. (4.4), we have CL k G [0,1) . Let us define the following sets: 

Z+ = {k\a k = Q J keZ+} 

Z+ = {k\a k ^ 0, k G Z + } (429) 

which results in Z x nZ 2 = and Z x KJ Z 2 = Z . The following three cases need to 
be considered. In every case, we only need to discuss the case where k belongs to an infinite 
set. 
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Case i). Z x is an infinite set and Z 2 is a finite set. Let us discuss k G Z x . From the 
definition in Eq. (4.29), it follows that a\ = 0. Hence it is clear from the definition of deadzone 
(4.4) that <| e k |< X \ y k _ x - y x | which means lim^^ e k -> according to Eq. (4.28). 

Case ii). Z x is a finite set and Z 2 is an infinite set. Let us discuss k G Z 2 . From the 
definition in (4.29), it follows that a\ ^ 0. Hence it is clear from deadzone (4.4) that 
I e k 1= <*k\ e k\ + A\ y k _ x ~y h _ x | which means lim^^ e k =0 due to Eqs. (4.27) and (4.28). 

Case iii). Z x and Z 2 are infinite sets. If k G Z x then au = 0. Following Case i) gives 
littl^^ e k = . Otherwise, a k # 0, it follows from Case ii) that lim^^ e k = . 
Based on the discussions for the above three cases, we can conclude that lim^^ Cl k e k =0 
implies that lim^^^ e k = . This completes the proof. 

5. Conclusion 

In this chapter, we have formulated and discussed the adaptive estimation and control 
problems for a class of semi-parametric models with both parametric uncertainty and non- 
parametric uncertainty. For a typical semi-parametric system model, we have discussed new 
ideas and principles in how to estimate the unknown parameters and non-parametric part 
by making full use of a priori knowledge, and for a typical type of a priori knowledge on the 
non-parametric part, we have proposed novel information-concentration estimator so as to 
deal with both kinds of uncertainties in the system, and some implementation issues in 
various cases have been discussed with applicable algorithm descriptions. Furthermore, we 
have applied the ideas of adaptive estimation for semi-parametric model into two examples 
of adaptive control problem for two typical semi-parametric control systems, and discussed 
in details how to establish the closed-loop stability of the whole system with semi- 
parametric adaptive estimator and controller. Our discussions have demonstrated that the 
topic in this chapter is very challenging yet important due to its wide 

background. Especially, for the closed-loop analysis problem of semi-parametric adaptive 
control, the examples given in this chapter illustrate different methods to overcome the 
difficulties. 

In the first example of semi-parametric adaptive control, we have investigated a simple first- 
order nonlinear system with both non-parametric uncertainties and parametric 
uncertainties, which is largely motivated by the recent-year exploration of the capability and 
limitations of the feedback mechanism. For this model, based on the principle of the 
proposed IC estimator, we have constructed a unified adaptive controller which can be used 
in both cases of b = 1 and b > 1. When the parametric part is of linear growth rate (b = 1), we 
have proved the closed-loop stability under some assumptions and a simple algebraic 

ML 3 p- 

condition < h V 2 , which reveals essential connections with the known magic 

M' 2 
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number L = h y2 discovered in recent work [XGOO] on the study of feedback 

2 

mechanism capability. 

In the second example of semi-parametric adaptive control, we further assume that the 
control gain is also unknown, yet the system is noise-free, and we have designed an 
adaptive controller based on gradient-like estimation algorithm with time-varying deadzone 
according to the a priori knowledge on the non-parametric part and the unknown controller 
gain. In this example, although we cannot establish perfect results revealing the magic 

number — h V2 as in the first example, we can still establish good results of asymptotic 

tracking performance under some mild conditions. This example has demonstrated yet 
another method to deal with uncertainties in semi-parametric model. 

Finally, we shall remark that the discussed topic in this chapter is still in its infant stage, and 
many more chanlenging problems can be investigated in the future. These problems may 
root in wide practical background where the system model is only partially known a priori, 
that is to say, the major part of the system can be parameterized and the other part is 
unknown and non-parameterized with only limited a priori knowledge. Solving such 
problems can definitely improve our understanding to the whole feedback mechanism and 
help us gain more insights on the capability of adaptive control, especially non-traditional 
adaptive control methods which were not extensively addresed and studied in previous 
study. Therefore, we expect more theoretical study in this new topic, i.e. semi-parametric 
adaptive estimation and control. 
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1. Introduction 

The problems of the output regulations and/ or disturbance reductions have attracted a lot 
of interest and have been actively researched in the consideration of the control problem for 
systems which are required to have servomechanism and for vibration attenuation in 
mechanical systems. It is well known that such problems are solvable using the Internal 
Model Principle in cases where the system to be controlled and the exosystem which 
generates the output reference signal and external disturbances are known. In the case 
where the controlled system is unknown and/ or the exosystem is unknown, adaptive 
control strategies have played active roles in solving such problems for systems with 
uncertainties. For known systems with unknown exosystems, solutions with adaptive 
internal models have been provided in (Feg & Palaniswami, 1991), (Nikiforov, 1996) and 
(Marino & Tomei, 2001). In (Marino & Tomei, 2001), an output regulation system with an 
adaptive internal model is proposed for known non-minimum phase systems with 
unknown exosystems. Adaptive regulation problems have also been presented for time 
varying systems and nonlinear systems (Marino & Tomei, 2000; Ding, 2001; Serrani et al., 
2001). Most of these methods, however, assumed that either the controlled system or the 
exosystem was known. Only few adaptive regulation methods for unknown systems with 
unknown exosystems have been provided (Nikiforov, 1997a; Nikiforov, 1997b). The method 
in (Nikiforov, 1997a) is an adaptive servo controller design based on the MRAC strategy, so 
that it was essentially assumed that the order of the controlled system was known. The 
method in (Nikiforov, 1997b) is one based on an adaptive backstepping strategy. In this 
method, it was necessary to design an adaptive observer that had to estimate all of the 
unknown system parameters depending on the order of the controlled system. Further, the 
controller design based on the backstepping strategy essentially depends on the order of the 
relative degree of the controlled system. As a result, the controller's structure was quite 
complex in both methods for higher order systems with higher order relative degrees. 

In this paper, the adaptive regulation problem for unknown controlled systems is dealt with 
and an adaptive output feedback controller with an adaptive internal model is proposed for 
single input/ single output linear minimum phase unknown systems with unknown 
exosystems. The proposed method is based on the adaptive output feedback control 
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utilizing the almost strictly positive real-ness (ASPR-ness) of the controlled system and the 
controller is designed based on an expanded backstepping strategy with a parallel 
feedforward compensator (PFC) (Mizumoto et al., 2005). It is shown that, under certain 
assumptions, without a priori knowledge of the order of the controlled system and without 
state variables, one can design an adaptive controller with a single step backstepping 
strategy even when the system to be controlled has an unknown order and a higher order 
relative degree. Using the proposed method, one can not attain perfect output regulation, 
however, the obtained controller structure is relatively simple even if the system has a 
higher order and a higher order relative degree. 

2. Problem Statement 

Consider the following single input/ single output LTI system. 

x(t) = Ax(t)+bu(t) + C d w(t) 

y(t) = c T x(t) + d T w(t), () 

where x = [x 1 ,---,x n ] T e R n is the state vector and u,y g R are the input and the output, 
respectively. Further w(t) e R m is an unknown vector disturbance. 

We assume that the disturbances and the reference signal which the output y is required to 
track are generated by the following unknown exosystem: 



w(t) = A d w(t) 

y m (t)=cXt), 



,T ,, P) 



where A d e R mxm [ s a stable matrix with all its eigenvalues on the imaginary axis. It is also 
assumed that the characteristic polynomial of Ad is expressed by 

det(\I - A d ) = X m + a m _ 1 X^ 1 + - • • + a a X + a . (3) 

The objective is to design an adaptive controller that has the output y(t) track the reference 
signal y m (t) generated by an unknown exosystem given in (2) for unknown systems with 
unknown disturbances generated by the unknown exosystem in (2) using only the output 
signal under the following assumptions. 

Assumption 1 The system (1) is minimum-phase. 

Assumption 2 The system (1) has a relative degree of r. 

Assumption 3 c T A r_1 b > , i.e. the high frequency gain of the system (1) is positive. 

Assumption 4 The output y(t) and the reference signal y m (t) are available for measurement. 
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3. System Representation 

From Assumption 2, since the system (1) has a relative degree of r, there exists a smooth 
nonsingular variable transformation: [z T ,i] T j = Ox such that the system (1) can be 
transformed into the form (Isidori, 1995): 



where 



i(t) = A z z(t) + b z u(t)- 



*l(t) + D d w(t) 



q(t) = Qnq(t)+ 



Zl (t)+F d w(t) 



y = [l / 0,...,0]z 1 (t) + d T w(t), 



A. 



I r -lxr-l 



a _a r-l. 



(4) 



and c z e R n_r is an appropriate constant vector. From assumption 1, Q is a stable matrix 
because r](t) = Q r) il(t) denotes the zero dynamics of system (1). 



3.1 Virtual controlled system 

We shall introduce the following (r-l)th order stable virtual filter 1/f (s) with a state space 

representation: 



Zf(t) = A Uf z f (t) + b U( u(t) 
u fl (t) = <z f (t), 



(5) 



where z f = |z f ,---,z f J T and 



A, = 



I r -2xr-2 
L-Po"-Pr-2. 

bu ,=h-,l],cl =[l,0,-,0]. 



With the following variable transformation using the filtered signal z f given in (5): 
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Virtual controlled system 

Fig. 1. Virtual controlled system with a virtual filter 



?l(t)=Zl(0 

? i (t)=-b z u fi _ i (t) + z i (t) + ^c ? .z i _ j (t), 



(6) 



where 



c ?i =9 i -a r _ i , (l<i<r-l) 

r-1 



j=l 



3i=P r 



ei=Pr-i-l+XPr-i +j -l C ?^ 

the system (1) can be transformed into the following virtual system which has u f given 
from a virtual input filter as the control input (Michino et aL, 2004) (see Fig.l): 

^ (t) = a z ?(t)+ c^ y (t) + b z u fi (t) + c d T w(t) 

%(t) = A rl , y (t) + c^ 1 (t) + C dn w(t) (7) 

y(t) = ? 1 (t)+d T w(t), 

where r, y = feWf, % = fe 2 ,$ 3 ,-,$r] T and c a T = [l,0,-,0], c q = [c|,0,-,0,l] T • % and C^ 
are a vector and a matrix with appropriate dimensions, respectively. Further, A^ is given by 
the form of 



A u, 


" 





QnJ 
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Since A„ and CL are stable matrices, A n is a stable matrix. 



3.2 Virtual error system 

Now, consider a stable filter of the form: 



z Cf (t) = A Cf z Cf (t) + c Cf u fi (t) 

u f (t) = e T z Cf W+u fi (t), 



(8) 



where c c = [0,--,0,l] T and 



A,. = 



I„ 



m-lxm-1 

eT =[ a o-Pc >"va m -i-Pc m J- 



3^ ,$r ,'-,$r are chosen such that A r is stable. 

Let's consider transforming the system (7) into a one with Uf given in (8) as the input. Define 
new variables Xi and X 2 as follows: 



X 1 =§f n > + a in _ 1 §i ,n - 1 ) + - + a 1 ? 1 + a ? 1 
x 2 = 1y m) + a m -i4 m_1) + • • • + a i*ly + a 0«ly- 



(9) 



Since it follows from the Cayley-Hamilton theorem that 



A m +rr A n 



-a 1 A m + a I = 0, 



(10) 



we have from (2) and (7) that 



X 1 (t) = a z X 1 (t)+c 1 T X 2 (t)+b z u f (t) 
X 2 (t) = A r X 2 (t) + c n X 1 (t), 



(11) 



where 



,(m) 



.(m-l) 



u f = u fi ' + a m _ a u fi ' + • • • + a a u fi + a u fi 



(12) 



Further we have from (10) that 



e (m) + a m _ 1 e (m - 1) + • • • + a a e + a e = X 1 . 



(13) 
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Therefore defining E = [e,e,---,e^ m ^] , the following error system is obtained: 



E(t) = A E E(t)+X 1 (t) 

X 1 (t) = a z X 1 (t)+c 1 T X 2 (t) + b z n £ (t) 

X 2 (t) = A n X 2 (t) + S X 1 (t) 

e(t) = [l,0,-,0]E(t). 



(14) 



Obviously this error system with the input u f and the output e has a relative degree of m+1 

and a stable zero dynamics (because A^ is stable). 

Furthermore, there exists an appropriate variable transformation such that the error system 
(14) can be represented by the following form (Isidori, 1995): 







" 


~ 






r o 


z e (t 


) = A Zc z e (t) + 


Jv 


u f (t 


)+ 


c T 

z e 






[0" 






1z e 


;t)=Qz e iz e (t)+ 


1 


z e,l 


t) 


e(t) 


= z ei (t), 













%.(t) 



(15) 



where z e = [z e ,---,z e J T and r| z g R n 1 . Since the error system (14) has stable zero 
dynamics, Q z is a stable matrix. 

Recall the stable filter given in (8). Since we have from (8) that 



A™). 



■K-! u 



(m-1) 

f 



: u f m) + a m-l u f m 2) + ' ' ' 4 a; i ll f. ] a n u 



P Cl u f +(3 Co u f 



(16) 



the filter's output signal Uf can also be obtained from 



Zc(t) = A c z C( (t)H 



u £ (t) 



u f (t)=[lA-,0]z C( (t) 

by defining z c = [u f ,u f ,---,uj m ~ 'J .Using this virtual filter signal in the variable 

transformation given in (6), the error system (15) can be transformed into the following form, 
the same way as the virtual system (7) was derived, with Uf as the input. 



e(t) = a e e(t)+b e u f (t)+cTi, e (t) 
fl e (t) = Q e i, e (t)+b n e(t), 



(17) 
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where 











Virtual error system 
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Fig. 2. Virtual error system with an virtual internal model 



A c, 


~ 

c T 

Z e 





Q ^e. 



Since A c and Q z are stable matrices, Q e is a stable matrix. Thus the obtained virtual error 
system (17) is ASPR from the input Uf to the output e. 

The overall configuration of the virtual error system is shown in Fig.2. 

4. Adaptive Controller Design 

Since the virtual error system (17) is ASPR, there exists an ideal feedback gain k* such that 
the control objective is achieved with the control input: u f (t) = -k*e(t) (Kaufman et al., 1998; 
Iwai & Mizumoto, 1994). That is, from (8), if the filter signal u f can be obtained by 



u fi (t) = -k'e(t)-e T z C( (t), 



(18) 



one can attain the goal. Unfortunately one can not design u f directly by (18), because u f is 

a filter signal given in (8) and the controlled system is assumed to be unknown. In such 
cases, the use of the backstepping strategy on the filter (5) can be considered as a 
countermeasure. However, since the controller structure depends on the relative degree of 
the system, i.e. the order of the filter (5), it will become very complex in cases where the 
controlled system has higher order relative degrees. Here we adopt a novel design strategy 
using a parallel feedforward compensator (PFC) that allows us to design the controller 
through a backstepping of only one step (Mizumoto et al., 2005; Michino et al., 2004). 



4.1 Augmented virtual filter 

For the virtual input filter (5), consider the following stable and minimum-phase PFC with 
an appropriate order nf : 
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yf(t)=-a £l y f (t)+a£r, f (t)+b a u(t) 

%(t) = A f ri f (t)+b f y £ (t), 



(19) 
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Fig. 3. Virtual error system with an augmented filter 

where y f e R is the output of the PFC. Since the PFC is minimum-phase Af is a stable 

matrix. 

The augmented filter obtained from the filter (5) by introducing the PFC (19) can then be 

represented by 



z Uf (t) = A Zf z Uf (t) + b Zf u(t) 
u af (t) = c z T f z Uf (t) = u fi (t) + y f (t), 



(20) 



where z u = [z^,y f ,r|J J T and 



K 





~ 




X 





-% 


a f 2 


' z f 


\ 





bf 


A f_ 








c* =[c Uf/ l,0,-,0] 



Here we assume that the PFC (19) is designed so that the augmented filter is ASPR, i.e. 
minimum-phase and a relative degree of one. In this case, there exists an appropriate 
variable transformation such that the augmented filter can be transformed into the following 
form (Isidori, 1995): 

iia £ (t) = a ai u af (t)+aj 2 i| a (t)+b a u(t) 
"0" 



i|a(t) = A a I| a (t)H 



*a £ (t), 



where A a is a stable matrix because the augmented filter is minimum-phase. 
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Using the augmented filter's output u a , the virtual error system is rewritten as follows (see 
Fig.3): 

e(t) = a e e(t)+ b e (u 3f (t)+ 6 T z Cf (t)- y f (t))+ cjq e (t) 

4(t)=QeIle(t)+b n e(t). 

4.2 Controller design by single step backstepping 

[Pre-step] We first design the virtual input a t for the augmented filter output u a in (21) as 

follows: 

a 1 (t) = -k(t)e(t)-e(t) T z C( (t) + %(t) / (22) 

where k(t) is an adaptive feedback gain and 0(t) is an estimated value of , these are 
adaptively adjusted by 



k(t) = Yk e 2 (t)-o k k(t) / Yk >0,o k >0 

e(t) = r e z Cf (t)e(t)-o e e(t), rj =r e >o,o e >o. 



(23) 



Further, W (t) is given as follows: 



, i , (t)=D(y f )(-a fi W (t) + b a u(t)) 

D ( yf)= f°' if w* 6 * (24) 



1, if y f >5 



Yi 



where 5 is any positive constant. 

Now consider the following positive definite function: 

V ° =^e 2 + -^Ak 2 +iAe T r e - 1 Ae + iiJP e ii e , (25) 

2b e 2 Yk 2 

where 

Ak = k(t)-k\ A9 = e(t)-e, 

k* is an ideal feedback gain to be determined later and P e is a positive definite matrix that 
satisfies the following Lyapunov equation for any positive definite matrix R e . 

P e Q e +Q^P e =-R e <0. 
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Since Q e is a stable matrix, there exists such P e . 
The time derivative of Vo can be evaluated by 

V < -(k* - v ) e 2 - (A min [R e ] - Pi ) ||ile If + w i e 
-{y f -W }e-^-p 2 JAk 2 
-(a e A min [r e - 1 ]-p 3 )A8 2 + R 
with any positive constant p 2 to p 3 . Where Wj = u a - a a and 

||c | + 2|P e |||b 11 |b e ) 2 
4b* Pl 



R = 



4p 2 Yk 



4p 3 



(26) 



(27) 



[Step 1] Consider the error system, (a 1 -system, between u a and a a . The co 1 -system is 
given from (21) by 



<X 



= *««« +<^+^-«l- 



(28) 



The time derivative of a a is obtained as follows: 



dcii dcii rtT dai , da-, T 

a-, = — L a p e + — L 8, z f + — L b p u f + — l c*il 
oe oe * oe oe 

■ aa i i: . aa i ^ .^li 
ak(t) az Cf c < ae 

+ D(y f )(-a f T 0+ b a u), 

where 0^ = b e T . Taking (28) and (29) into consideration, the actual control input is 
designed as follows: 



(29) 



b aYf 
J 

~b 



I 2 II l|2 i 
Ciffli+Eo U a l +||q a || \(D 1 +E 1 W 1 (D 1 -W 2 



if |yf|^6 yf 

Cl©l + e o|J u a f | + lllaf J^l + £ 1^1®1 - ^2 
1 [ I, ,|2 

— [YfYf + " 



(30) 



c 2|l%ll y f -TT^-^O 2 / if |yf| >6 y f 
b a y f 
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where £ to £ 3 and y f are any positive constants, and W 1 and W 2 are given by 



^ 



dk 


2 ,-,2 

k + 




2 


e 


2 
+ 


da 1 


9Z c f 



FcJ + 1 



1Tr da t A 9a a - T aaj- - f da 1 i 



where 1 is any positive constant and d e ,b e ,0! ,p x are estimated values of a e ,b e ,0! ,p x , 
respectively, and adaptively adjusted by the following parameter adjusting laws. 



da-, 



a e (t) = -Ya^i(t)- i e(t)-o a a e (t) 
9e 

b e (t) = - Yb a, 1 (t)^u fi (t)-o b b e (t) 

e 1 (t)=-r ei z c ,(t)^« Bl (t)-oe i e 1 (t) 

p l (t) = Ypi « Bl (t)<^] 2 -o Pi p 1 (t) 



(31) 



where y a , y b ,\q ,o a ,o h ,o Qi ,Oa are any positive constants and r e =^q 1 > . 



4.3 Boundedness analysis 

For the designed control system with control input (30), we have the following theorem 
concerning the boundedness of all the signals in the control system. 

Theorem 1 Under assumptions 1 to 3 on the controlled system (1), all the signals in the 
resulting closed loop system with the controller (30) are uniformly bounded. 

Proof: Consider the following positive and continuous function Vi. 



Vj: 



v +Icd? +lAeTr 9 - 1 Ae 1 +-^Aa 2 e 



2Y« 



— Ab^- 
2 Yb 



2y Pl 



Ap?+-8 y . if U<8 y 



V +- ffl J +-A9 1 1 r 9 - 1 Ae l +— Aa 2 e 

+ J-Abl+-^-Af 1+ ^y 2 , if |y f |>5 y 
2 Yb 2 Yp, 2 



(32) 



76 



Adaptive Control 



where 



Aa e = a e (t) - a e , Ab e = b e (t) - b e 
A^^t)-^, Afo=fo(t)-fo, 

and 5 is any positive constant. 

From (26) and (32), the time derivative of Vi for |y f | < 5 can be evaluated by 

V, < -fk* - v --LV -(X min [R e ]-Pi -Vo)hef 



4e1 



— "P2 

Yk 



Ak 2 -(o 9 x min [r 9 - 1 ]-p 3 )Ae 2 



-cico 2 -(oe 1 x min [r e ; 1 ]-ii 1 )Ae 2 



^-p 2 |Aa 2 

Ya 



^PslAb 2 



Yb 



Oft 



YPx 



F4 



Ap?-(y f -'Fo(y f ))e + R 1 



(33) 



with any positive constants u to u 4 . Where 



R 1= R 



4b, 



4ft 



o 2 p 2 

4 "2Ya " 4usYb " 4u 4Y p ' 



llfl ||2 , °a a e , °b b e 



Here we have 



-(y f -^o)e = -^e-!Xi±%!l + (ZlZ^ + U5e2 



2p 5 



4u 5 



(34) 



with any positive constant u 5 . Furthermore, for |y f I < 5 , since ^(t) = is held, there 
exists a positive constant W M such that |y f (t)- W (t)| < W M . 

Therefore the time derivative of Vi can be evaluated by 



V^-a^+Ri 



(35) 



for y f < 5 V , where 
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cu = mm 



s a = mm 



2b e |k*-v - — -p 5 |,s a ,2 



Kin^eY 



Pi-Po 



2y t 



Yk 



"P2 



^max L e J 

zc l> z ~ " ' Z Ya 



^QflX^^l-pa) 
Vac[ r e _1 J 



V-2 
vYa 



/ 



/ 



2Yi 



^"P3 

Yb 



* 2 Y(3 



Yp a 



P 4 



^ 



M , c2 



4u 5 



For Ivf I > 5„ , the time derivative of Vi is evaluated as 

\j t y f ' 



Vk * -[k* - v -^je 2 -(X min [R e ]- Pl -Po)Kf 
Ak^ioeX^Jre^l-pj^-qco 2 

/ 

-(oe i x min [r e - i 1 ]- V i 1 )Ae 2 . 



— "P2 

Yk 



Ya 



P2 



Aaf 



Yb 



V3 



Ab 



YPa 



^4 



A(3J +R a 



-a fi y f +a f2 i] f y f -Yfyf — e 2 ||i| f || y f -£3^ 
-V e-y f e, 



and thus we have for y f > 5 that 



(36) 



where 



Qu = mm 



2b e 



V a <-a b V a +R 2 , 






4el a £j 4e 3 



' s a' 2 Yf 



R 2 =R X +J 



4e, 



(37) 



(38) 
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Finally, for an ideal feedback gain k* which satisfies 



k > v H hv a , v 1 =max\i 5 , 

4el 



the time derivative of Vi can be evaluated by 

V 1 <-aV 1 +R, 



% 4£ 3 



(39) 



where a = min[a a ,a b ] > 0, R = max[R 1 ,R 2 J . Consequently it follows that Vi is uniformly 
bounded and thus the signals e(t),G) 1 (t),ri e (t),y f (t),r| f (t) and adjusted parameters k(t),0(t), 
a e (t),8 1 (t),b e (t),p 1 (t) are also uniformly bounded. 

Next, we show that the filter signal z c and the control input u are uniformly bounded. 
Define new variable z F as follows: 



4: )+ pc„_ 1 4r 1)+ - + pc ^=? 1 

?i = a z?i + 4 n y + b z u fi + c\ w 
i| y =A I ,n y + b I1 § 1 + C dii w, 
where ^ and i] y have been given in (7). Further define z« by 

z Pl =a z z Pi +b z z Cf +n Pi 



< )+ Pc m . 1 <- 1) +-+Mp 1 = c s T i y + ^ w ' 



(40) 
(41) 
(42) 

(43) 
(44) 



where z Cfi = [l,0,---,0]z C/ and we set z£>(0) = z£>(0),k = 0,-,m .We have from (40) and (41) 
that 



(m) 



§ 1 -a z ? 1 =z^ +1 » + (p Cm _ i -a z )z<7 

+ (pc m . 2 -a z p Cm J4r 1)+ - 

+ (Pc -a z Pc 1 K 1 -a z z ?1 

= bzU fl +4n y + 3 d r i w. 



(45) 



Further, we have from (43), (44) and (8) that 
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+ - + (pc -a z Pc 1 H i -a z z ( 3 i (46) 

= b z u fi +c| , i| y +^ , i w. 

It follows from (45) and (46) that z^ } = zj£ } , k = / -,m. 

Define z § = ^^.-^f and z p = [z^,^,-.^^] 1 . Since s m + p Cm . 1 s m - 1 + - + (3 Co 
is a stable polynomial, we obtain from (40) that 

IMHHI- 1 ^!^' < 47 > 

with appropriate positive constants h, I2. From the boundedness of w(t) and e(t), we have 
§i(t) is bounded and thus z« is also bounded. 

Furthermore defining % = [ru ,% /--/ilp J / we have from (44) that 



ilp(t) = A Cf rip(t)H 

From (8) and (48), we obtain 

b z Zc ( (t)+np(t) 

= A Cf (b z z Ci (t)+i, p (t))+b z 



(4n y W + cJw(t)). 



(48) 



u fl (t)n 



(c£r, y (t) + c d T w(t)). 



(49) 



Therefore b z z (t)+ %(t) can be evaluated from (48) and the fact that u f =a> 1 +a 1 -y ( by 



|Mc f W+ 1p(t)| * |A C( ||b z z Cf (t)+ i, p (t)| 

+ b z |a 1 (t)-y £ (t)| + b z |a> 1 (t)| 

+ |cJk(t)|| + |c dl ||w(t)|. 



(50) 



Here, we have from (22) that 

a 1 (t)-y f (t) = -k(t)e(t)-^{b z z C( (t) + r, p (t)} + ^ I , p (t) + %(y f )-y f (t). (51) 
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Since it follows from (19) and (24) that 

y f (t)-W (t) = -a fi (y f (t)-Wo(t))+a^ii f (t) 



(52) 



for |y f | > 5 and from the boundedness of r| f (t), there exists a positive constant such that 
|y f -W (t)| < W M . Further, from the boundedness of w(t) and e(t) i.e. ^(t), we can confirm 
that r| (t) and %(t) are also bounded from (7) and (48). Finally, taking the boundedness of 

the signals e(t), oo 1 (t),k(t) / 8(t)and r\ (t),%(t) into consideration, from (50) b z z c (t)+ f|g(t)can 
be evaluated by 



||b z z Cf (t)+ fi p (t)|| < l Zi |b z z Cf (t)+ i|p(t)| + 1 Z2 (53) 

with appropriate positive constants l z and l z . Consequently, considering the system: 

Zp(t) = a z z p +b z z Cf +n p (t) (54) 

from (44) with b z z c + %(t) as the input and z« as the output, since this system is 

minimum-phase and the inequality (53) is held, we have from the Output/ Input L p Stability 
Lemma (Sastry & Bodson, 1989) that the input b z z c +%(t) in the system (54) can be 

evaluated by 



|kz Cf (t)+rip(t)|<l Zi |zp(t)|+I z 



(55) 



with appropriate positive constants 1 and 1 . From the boundedness of zJt) and %(t), 

we can conclude that z c (t) is uniformly bounded and then the control input u(t) is also 

uniformly bounded. Thus all the signals in the resulting closed loop system with the 
controller (30) are uniformly bounded. 

5. Simulation Results 

The effectiveness of the proposed method is confirmed through numerical simulation for a 
3rd order SISO system with a relative degree of 3, which is given by 



(56) 





-1 -0.5 


0.5 









1 0.1 0.1 0.1 


z = 


1.5 -2.5 


-0.5 


z + 





u + 


0.1 0.1 1 0.1 




-2.5 0.5 


1 




1 




0.1 0.1 0.1 1 


y = 


z a + [0.1 0.1 


1 0. 


l]w 









where w is an unknown disturbance which has the following form: 
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sin(2t) 
2cos(2t) 
0.5sin(5t) 
2.5cos(5t) 



Before designing a controller, we first introduce the following pre-filter: 

_b_ 

s + a 



(57) 



(58) 



in order to reduce the chattering phenomenon to be expected by switching the controller 
given in (30). Therefore, the considered controlled system has a relative degree of 4. 

Since the relative degree of the controlled system is 4, we consider a 3rd order input virtual 
filter in (5). Further we consider a stable internal model filter (8) of the order of 4. 

For the input virtual filter, in this simulation, we consider a first order PFC: 

Yi =-a fi y f +b a u 

in order to make an ASPR augmented filter. 

The design parameters for the pre-filter (58), the input virtual filter (5) and the internal 
model filter (8) are set as follows: 

a = b = 1000 

p 0= 15,p 1= 75,p 2 =125 

P Co =20,p Ci =150,p c2 =500,p c3 =625 

and the PFC parameters are set by 

a f = 10,b a =0.01. 

h a 

Further design parameters in the controller given in (23), (24), (30) and (31) are designed by 



y k = 500,o k = 0.01,6 y 
1 = 0.5,00 =0.05,o e =o a =. 



:10 



'Pi 



= 0.1 



r0=r ei =5OOOI 4 ,Y a = Y b =Y p =lOO 
c a = 1000,£ = e 1 = £ 2 = 0.01,83 = Yf = 10 °- 

Figure 4 shows the simulation results with the proposed controller. In this simulation, the 
disturbance w is changed at 50 [sec]: 
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sin(2t) 
2cos(2t) 
0.5sin(5t) 
2.5cos(5t) 



2sin(4t) 

4cos(4t) 

0.5sin(20t) 

2.5cos(20t) 



Figure 5 is the tracking error and Fig.6 shows the adaptively adjusted parameters in the 
controller. 
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Fig. 4. Simulation results with the proposed controller 
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Fig. 5. Tracking error with the proposed controller 
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Fig. 6. Adaptively adjusted parameters 
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A very good control result was obtained and we can see that a good control performance is 
maintained even as the frequencies of the disturbances were changed at 50 [sec]. 

Figures 7 and 8 show the simulation results in which the adaptively adjusted parameters in 
the controller were kept constant after 40 [sec]. After the disturbances were changed, the 
control performance deteriorated. 
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Fig. 7. Simulation results without adaptation after 40 [sec] 
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6. Conclusions 

In this paper, the adaptive regulation problem for unknown controlled systems with 
unknown exosystems was considered. An adaptive output feedback controller with an 
adaptive internal model was proposed for single input/ single output linear minimum phase 
systems. In the proposed method, a controller with an adaptive internal model was 
designed through an expanded backstepping strategy of only one step with a parallel 
feedforward compensator (PFC). 
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1. Introduction 

Robots today have an ever growing niche. Many of today's robots are required to perform 
tasks which demand high level of accuracy in end effector positioning. The links of the robot 
connecting the joints are large, rigid, and heavy. These manipulators are designed with 
links, which are sufficiently stiff for structural deflection to be negligible during normal 
operation. Also, heavy links utilize much of the joint motor's power moving the link and 
holding them against gravity. Moreover the payloads have to be kept small compared to the 
mass of the robot itself, since large payloads induce sagging and vibration in the links, 
eventually bringing about uncertainty in the end effector position. In an attempt to solve 
these problems lightweight and flexible robots have been developed. These lightweight 
mechanical structures are expected to improve performance of the robot manipulators with 
typically low pay load to arm weight ratio. The ultimate goal of such robotic designs is to 
accurate tip position control in spite of the flexibility in a reasonable amount of time. Unlike 
industrial robots, these robot links will be utilized for specific purposes like in a space 
shuttle arm. These flexible robots have an increased payload capacity, lesser energy 
consumption, cheaper construction, faster movements, and longer reach. However, link 
flexibility causes significant technical problems. The weight reduction leads the manipulator 
to become more flexible and more difficult to control accurately. The manipulator being a 
distributed parameter system, it is highly non-linear in nature. Control algorithms will be 
required to compensate for both the vibrations and static deflections that result from the 
flexibility. This provides a challenge to design control techniques that: 

a) gives precise control of desired parameters of the system in desired time, 

b) cope up with sudden changes in the bounded system parameters, 

c) gives control on unmodeled dynamics in the form of perturbations, and 

d) robust performance. 

Conventional control system design is generally a trial and error process which is often not 
capable of controlling a process, which varies significantly during operation. Thus, the quest 
for robust and precise control led researchers to derive various control theories. Adaptive 
control is one of these research fields that is emerging as timely and important class of 
controller design. Area much argued about adaptive control is its simplicity and ease of 
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physical implementation on actual real-life systems. In this work, an attempt has been made 
to show the simplicity, ease and effectiveness of implementation of direct model reference 
adaptive control (DMRAC) on a multi input multi output (MIMO) flexible two-link system. 
The plant comprises of a planar two-link flexible arm with rotary joints subject only to 
bending deformations in the plane of motion. A payload is added at the tip of the outer link, 
while hub inertias are included at actuated joints. The goal is to design a controller that can 
control the distal end of the flexible links. 

Probably the first work done pertaining to the control of flexible links was presented by 
(Cannon & Schmitz, 1984). Considering a flexible link, which was only flexible in one 
dimension (perpendicular to gravity), a Linear Quadratic Gaussian controller was designed 
for the position control. Direct end point sensing was used and the goal was to execute a 
robot motion as fast as possible without residual vibrations in the beam. Also, experiments 
were carried out on end point control of a flexible one link robot. These experiments 
demonstrated control strategies for position of one end to be sensed and precisely 
positioned by applying torque at the other end. These experiments were performed to 
uncover and solve problems related to the control of very flexible manipulators, where 
sensors are collocated with the actuators. 

(Geniele et al., 1995) worked on tip-position control of a single flexible link, which rotates on 
a horizontal plane. The dynamic model was derived using assumed-modes method based 
on the Euler-Bernoulli beam theory. The model is then linearized about an operating point. 
The control strategy for this non-minimum phase linear time varying system consisted of 
two parts. The first part had an inner stabilizing control loop that incorporates a 
feedforward term to assign the system's transmission zeros at desired locations in the 
complex plane, and a feedback term to move the system's poles to the desire positions in the 
left half plane. In the second part, the other loop had a feedback servo loop that allowed 
tracking of the desired trajectory. The controller was implemented on an experimental test 
bed. The performance was then compared with that of a pole placement state feedback 
controller. 

(Park & Asada, 1992) worked on an integrated structure and control design of a two-link 
non-rigid robot arm for the purpose of high speed positioning. A PD control system was 
designed for the simple dynamic model minimizing the settling time. Optimal feedback 
gains were obtained as functions of structural parameters involved in the dynamic model. 
These parameters were then optimized using an optimization technique for an overall 
optimal performance. 

(Lee et al., 2001) worked on the adaptive robust control design for multi-link flexible robots. 
Adaptive energy-based robust control was presented for both close loop stability and 
automatic tuning of the gains for desired performance. A two-link finite element model was 
simulated, in which each link was divided into four elements of same length. The controller 
designed was independent of system parameters and hence possessed stability robustness 
to parameter variations. 

Variations in flexible links have also been researched. Control of a two-link flexible arm in 
contact with a compliant surface was shown in (Scicliano & Villani, 2001). Here, for a given 
tip position and surface stiffness, the joint and deflection variables are computed using 
closed loop inverse kinematics algorithm. The computed variables are then used as the set 
points for a simple joint PD control, thus achieving regulation of the tip position and contact 
force via a joint-space controller. 
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(Ider et al., 2002) proposed a new method for the end effector trajectory tracking control of 
robots with flexible links. In order to cope with the non-minimum phase property of the 
system, they proposed to place the closed-loop poles at desire locations using full state 
feedback. A composite control law was designed to track the desired trajectory, while at the 
same time the internal dynamics were stabilized. A two-link planar robot was simulated to 
illustrate the performance of the proposed algorithm. Moreover the method is valid for all 
types of manipulators with any degree of freedom. 

(Green, A. & Sasiadek, J., 2004) presented control methods for endpoint tracking of a two- 
link robot. Initially, a manipulator with rigid links is modeled using inverse dynamics, a 
linear quadratic regulator and fuzzy logic schemes actuated by a Jacobian transpose control 
law computed using dominant cantilever and pinned-pinned assumed mode frequencies. 
The inverse dynamics model is pursued further to study a manipulator with flexible links 
where nonlinear rigid-link dynamics are coupled with dominant assumed modes for 
cantilever and pinned-pinned beams. A time delay in the feedback control loop represents 
elastic wave travel time along the links to generate non-minimum phase response. 
An energy-based nonlinear control for a two-link flexible manipulator wasstudied in (Xu et 
al., 2005). It was claimed that their method can provide more physical insights in nonlinear 
control as well as provide a direct candidate for the Lyapunov function. Both simulation and 
experimental results were provided to demonstrate the effectiveness of the controllers 
A robust control method of a two-link flexible manipulator with neural networks based 
quasi-static distortion compensation was proposed in (Li et al., 2005). The dynamics 
equation of the flexible manipulator was divided into a slow subsystem and a fast 
subsystem based on the assumed mode method and singular perturbation theory. A 
decomposition based robust controller is proposed with respect to the slow subsystem, and 
H M control is applied to the fast subsystem. The proposed control method has been 

implemented on a two-link flexible manipulator for precise end-tip tracking control. 
In this work a direct adaptive controller is designed and the effectiveness of this adaptive 
control algorithm is shown by considering the parametric variations in the form of additive 
perturbations. This work emphasizes the robust stability and performance of adaptive 
control, in the presence of parametric variations. This approach is an output feedback 
method, which requires neither full state feedback nor adaptive observers. Other important 
properties of this class of algorithms include: 

a) Their applicability to non-minimum phase systems, 

b) The fact that the plant (physical system) order may be much higher than the 
order of the reference model, and 

c) The applicability of this approach to MIMO systems. 

Its ease of implementation and inherent robustness properties make this adaptive control 
approach attractive. 

2. Mathematical Modeling of the System 

In this section mathematical model of the system is derived using Lagrange equations with 
the assumed-modes method. The links are assumed to obey Euler-Bernoulli beam model 
with proper boundary conditions. A payload has been added to the tip of the second link, 
while hub inertias are included at the actuator joints. 
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2.1 Kinematic Modeling 

A planar two-link flexible arm with rotary joints subject to only bending deformations in the 
plane of motion is considered. The following coordinate frames, as seen in Fig. 1, are 
established: the inertial frame (x ,y)> the rigid body moving frame associated to link i 

(x ,Y.)f an d the flexible body moving frame associated with link i (x.,Y.) (Brook, 1984). 




Fig. 1. Planar Flexible Two-Link Arm 

The rigid body motion is described by the joint angle, , while y.(x.) denoted the 

transversal deflection of link i at abscissa, < x. < /. , I being the link length. Let 

p l .{x.) = (x .,y (x .)) T be the position of a point along the deflected link i with respect to frame 

(X.,Y.) and pi be the absolute position of the same point on frame (X Q ,Y Q )- Also, r l = />'(/.) 

indicates the position of the origin of frame ( X { ,Y. ) with respect to frame (X.,Y.), and r z 

gives absolute positioning of the origin of frame (X.,Y.) with respect to frame (X Q ,Y Q )- The 

rotation matrix A; for rigid body motion and the rotation matrix Ei for the flexible mode are, 
respectively, 



A, 



cos 0. - sin 0. 
sin 0. cos 0. 



1 

y'te 



1 



(1) 



where y' ie = (S ■ / 5 xi ) \ x =l and for small deflections arctan(y' e )D y' k . Therefore, the previous 
absolute position vectors can be expressed as, 



= r 1+ W, p. 



E, = r. +1 = r[ +W. r. +1 



(2) 



where, W i ^is the global transformation matrix from (X ,Y ) to (X.,Y.) f which obeys the 
recursive equation W { = W^E^A^ = VV^_ 1 A i and W =I 
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2.2. Lagrangian Modeling 

The equations of motion for a planar n-link flexible arm are derived by using the Lagrange 
equations. The total kinetic energy is given by the sum of the following contributions: 



r =IX + Z^ +7 ; 



(3) 



i-i i-i 



where the kinetic energy of the rigid body located at the hub i of mass m hi and the moment 
of inertia J hi is 



T l •^ 1 T * 



(4) 



where d { is the (scalar) absolute angular velocity of frame (X.,Y.) given by 



7=1 k=\ 



(5) 



Moreover, the absolute linear velocity of an arm is 

p = r i+ W i p l i+ W l i p i 



(6) 



and f- +1 = p\(li) . Since the links are assumed inextensible ( x { = ) , then /?-(x ; ) = (0, ]/,•(*,• )) T 
The kinetic energy pertaining to link i of linear density p i is 



T ii=-[ Q PX x i)Ph x i) dx i 



2 Jo 



(7) 



and the kinetic energy associated to a payload of mass m and moment of inertia J located 
at the end of link n is 



T p =\mf n+1 r n+i+ ^J p (a n+ y'J 



(8) 



Now, in the absence of gravity (horizontal plane motion), the potential energy is given by 



M' 



^=£^=£-1 (£'),(*,) 



i=i 1=1 2 



d 2 yX x i) 
dx; 



dx ; 



(9) 



Where \1 { is the elastic energy stored in link i, and (EI) { being its flexural rigidity. No 
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discretization of structural link flexibility has been made so far, so the Lagrangian will be a 
functional. 

2.3. Assumed Mode Shapes 

Links are modeled as Euler Bernoulli beams of uniform density pi and constant flexural 

rigidity (EI) f with the deformation y (x t , t) satisfying the partial differential equation 



{El) p^ + Pi d^x il t 1 = (j ._ l n 



dxt 



dr 



(10) 



Boundary conditions are imposed at the base of and the end of each link to solve this 
equation. The inertia of a light weight link is small compared to the hub inertia, and then 
constrained mode shapes can be used. We assume each slewing link to be clamped at the 
base 



^(o,o = o,^'(o,o = o,i = i w 



(ii) 



For the remaining boundary conditions it is assumed that the link end is free of dynamic 
constraints, due to the difficulty in accounting for time-varying or unknown masses and 
inertias. However, we consider mass boundary conditions representing balance of moment 
and shearing force, i.e. 



{EI\ 



(EI\ 



dl yX x i, t ) 






--Jn 



dt l 



dy i (x. , t) 

dx. 



-(MDX—iy^X^.) 

at ' ' 



dx: 



-M Li ^ J (y i (x i X= l )-(MD) i ^- 2 

at ' ' at 



dy t (x. , t) 

dx. 



(12) 



i = l,....,n 



where, M z/ and / z .are the actual mass and moment of inertia at the end of link z. (MD) f 

accounts for the contribution of masses of distal links, i.e. non-collocated at the end of link i. 
A finite-dimensional model of link flexibility can be obtained by assumed modes technique. 
Using this technique the link deflections can be expressed as 



7=i 



(13) 



where £..(0 are the time varying variables associated with the assumed spatial mode 
shapes (p..(x ) of link z. Therefore each term in the general solution of (10) is the product of a 
time harmonic function of the form 



^(0 = expO'^0 



(14) 
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and of a space eigenfunction of the form 

^(x,) = q.. sin(^.x-) + C 2 , cos(^x-) + C 3 , sinh(^.x-) + C 3 , cosh(^x-) (15) 



In (14) cd.. is the ;th natural angular frequency of the eigenvalue problem for link i, and in 
(i5)fi 9 =^ Pl /(EI) r 



Application of the aforementioned boundary conditions allows the determination of the 
constant coefficients in (15). The clamped link conditions at the link base yield 

q v =-C,^C 4 , v =-C 2 ,, (16) 

while, the mass conditions at the link end lead to homogeneous system of the form 



[>te)] 






(17) 



The so-called frequency equation is obtained by setting to zero the determinant of the (2x2) 
matrix F(j3 ) that depends on explicitly on the values of M L . , J L . > and (MD) ■ The first m i 

roots of this equation give the positive values of p.. to be plugged in (15). Using this the 

coefficients C and C, are determined up to a scale factor that is chosen via a suitable 

Uj 2,ij r 

normalization. Further the resulting eigenf unctions (p.. satisfy a modified orthogonality 
condition that includes the actual M Li , J L , and (MD) r In an open kinematic chain 
arrangement, M Li is the constant sum of all masses beyond link i, but J u and (MD) i 

depend on the position of successive links. This will considerably increase the complexity of 
model derivation and overload the computational burden of on-line execution. Thus, some 
practical approximation leading to constant although nonzero boundary conditions at the 
link end is done. Thus, a convenient position is set to (MD) = and compute J Li for a fixed 

arm configuration. In this case, it can be shown that det(F) = results in the following 
transcendental equation (De Luca & Scicliano, 1989) 

(i+cos(/y ; )cosh(/y^ 
? 3 



-^(sin(^/,.)cosh(^./,) + cos(^i,)sinh(^i,)) ( 18 ) 

.^4^(l-cos(/y>sh(/y,.)) = 



+ " 
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2.4. Closed-Form Equations of Motion 

On the basis of the discretization introduced in the previous section, the Lagrangian L 
becomes a function of set of N generalized coordinates c\i(t) the dynamic model is obtained 
satisfying the Lagrange-Euler equations 



d_ 
dt 



r dL^ 



dq t 



~ = fi. i = l-N (19) 

dq i 



where, fi are the generalized forces performing work on c\\(t). Under the assumption of 
constant mode shapes, it can be shown that spatial dependence present in the kinetic energy 
term (7) can be resolved by the introduction of a number of constant parameters, 
characterizing the mechanical properties of the (uniform density) links (De Luca, et. al. 1988, 
Cetinkunt, et. al, 1986) 

m,=J o */>A=M (20) 

1 Cli 1 

d.= — p.x.x. =— I. 
1 m t Jo l l l 2 l (21) 

rli 2 1 2 

J™ = Pi x i dx. = — ml. 

01 Jo l l l i n 



(22) 



li 

HYij\~ij—i p3) 



v ij = [ Pi%{ x i)dx i 



w ij=[ {) Pi ( PiA x iWx i (24) 

z ift = SlpM x i)<P*(Xi)dx i (25) 

k ijk = jj (EI)i<Py ( x i ) <Pik ( x i ) dx i (26) 

where, m is the mass of the link i, d is the distance of center of mass of link i from joint i 

axis, J is the inertia of link i about joint i axis, v and co are the deformation moments of 

order zero and one of mode j of the link i. Also, £ is the cross elasticity coefficient of 

modes j and k of link i. The actual numerical values of the previous parameters are 
calculated off-line. As a result of this procedure, the equations of motion for a planar n-link 
arm can be written in a familiar closed form 

B(q)q + h(q, q) + Kq = Qu (27) 
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where q = U ^ ^ ^ ^ S nl S nm f is the N-vector of generalized coordinates ( N = n + £ . m . ), 

and u is the n-vector of joint actuator torques. B is the positive definite symmetric inertia 
matrix, h is the vector of Corriolis and centrifugal forces, K is the stiffness matrix and Q is 

the input weighting matrix that is of the form \I O (N _ ,\ due to the clamped link 

assumptions. Joint viscous friction and link structural damping can be added as Dq , where 

D is a diagonal matrix. It is noted that orthonormalization of mode shapes implies 
convenient simplification in the diagonal blocks of the inertia matrix relative to the 

deflections of each link, due to the particular values attained by z.. k in (25). Also the 

stiffness matrix becomes diagonal (k = -- = K =0;K ,...,K > 0) being £ =0 for J ' ^ k 

in (27). The components of h can be evaluated through the Christoffel symbols given by 



* = II 



dB, \dB tt 



dq k 2 d qj 



Vflk 



(28) 



2.5. Explicit Dynamic Model of Two-Link Flexible Arm 

Two assumed mode shapes are considered for each link (m 1 =m 2 = 2 ). Thus, the vector of 

Lagrangian coordinates reduces to q = (6 x 6 2 8 n S l2 S 2l S 22 ) T > i- e - N = 6. It can be shown 

(Brook, 1984, De Luca et. al. 1988) that the contributions of kinetic energy due to deflection 
variables are 



[factor of Si) = z n 
M T , 



[factor of 2Sj i2 ) = [cp ile (p' ile ~\ 



t(MD\ 



[factor of 8f 2 } = z i2 



\(MD\ 



<Pi2,e 
<P'i2,e 



+ Z, 



(29) 

(30) 
(31) 



where, ^. e = (p.. ( x . ) |^^ and qj.. e =q> t y {x i ) \ x _ v ij = 1,2. The above equations are 

obtained expanding terms (7) and (8) by using (5) and (6). Accounting for separability (13) 
then leads to expressions for the factors of the quadratic deflection rate terms, in which 
parameters defined in (25) and the mass coefficients on the right hand side of (12) can be 
identified. It is found for link-1: 



M L1 =m 2 +m h2 +m p 



^Ll= J 02 +J h2 +J P +m P l 2 



(32) 
(33) 



(MD\ = ( m 2 d 2 + m p l 2 ) cos<9 2 - [ ( v 21 + m p cp 2le )S 2l + ( v 22 + m p cp 22e ) S 22 ] sin<9 2 ( 34 ) 
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Note that in the case of only two links, J Ll is a constant. On the other hand for link-2: 

M L2 =m p/ J L2 = J p , (MD) 2 = (35) 

A convenient normalization of mode shapes is accomplished by setting: 

z,,.=m,., i,7 = l,2 (36) 

This also implies that the nonzero coefficients in the stiffness matrix K take on values w 2 m . 

It is stressed that, if the exact values for the boundary conditions in (12) were used the 
natural orthogonality of the computed mode shapes would imply that { factor of 28 8 } is 

zero for both links. For link-2 the use of (35) automatically ensures the "correct" 
orthogonality of mode shapes. On the other hand, however for link-1, the off-diagonal term 
(MD), varies with arm configuration. This implies that the mode shapes- which are spatial 

quantities-would become implicit functions of time, thus conflicting with the original 
separability assumption. It is seen that for different positions of second link, (MD)i results in 
variations of (34), so the actual mode shapes of the first link become themselves functions of 
time-varying variables describing the deflection of the second link. A common 
approximation in computing the elements of the inertia matrix for flexible structures is to 
evaluate kinetic energy in correspondence to the undeformed configuration. In our case, it is 
equivalent to neglecting the second term (MD) { in (34), which is an order of magnitude 

smaller than the first term. Accordingly, (MD) is constant for a fixed arm configuration. 

Taking 2 = ±k/2 leads to (MD\ = and thus the eigen-frequencies can be computed 

through (19). This is equivalent to having zeroed only that portion of the {factor of 28 8 } 

generated by constant diagonal terms, i.e. 



[<Pu* <Pn,e] 



M Li 
J Ti 



<Pl2,e 
<Pu,e 



(37) 



This will produce nonzero off-diagonal terms in the relative block of the inertia matrix. The 
resulting model is cast in a computational advantageous form, where a set of constant 
coefficients appear that depend on the mechanical properties of the arm. The inertia matrix 
as well as other derivations can be found in (Miranda, 2004). Once having obtained the 
expressions of the inertia matrix, the components of h can be evaluated using (28). Viscous 
friction and passive structural damping are included in matrix D for improvement in arm 
movement, and finally, the stiffness matrix K is of the form, 

1 ?} ?} ?} 9 I 

K = diag }0, 0, zv^mi, zv i2 m 1, w 21 m 2' w 22 m 2 I (38) 

Then the equations of motion is given in its standard form as 

B(q)q + h(q,q)+ Dq + Kq = Qu (39) 
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After tremendous of algebra and neglecting friction, (39) can be written as, 



B n Si + B 12 02 + B 13 In + B 14 hi + B 15 hi + B 16 hi + h = u pl 

B 2 i h + B 2 2 0*2 + B 13 hi + B 14 hi + B 15 hi + B 16 hi + h 2 = u p2 

B 31 2 + B 32 2 + B 33 hi + B 34 hi + B 35 hi + B 36 hi + h 3 + K 3 5n = 

B41 h + B42 02 + B43 hi + B 44 hi + B 45 hi + B 46 hi +h4+K4 5i 2 = ° 

B 15 h + B 51 h + B 53 hi + B 54 hi + B 55 hi + B 56 hi + h 2 + K 5 S 2 i = 

B 61 h + B 61 h + B 63 hi + B 64 hi + B 65 hi + B 66 hi + h 2 + K 6 5 22 = 



(40) 



where, u p i and u v i are input torques to joints 1 and 2, respectively. Plant outputs are 
considered to be link tip displacements yi and 1/2. As seen from above equations, the system 
is highly nonlinear and of 12 th order. For the flexible robot, the following physical 
parameters were considered 

p 1 = p 2 = 0.2kg /m 

ll = l2 = 0.5 m, d2 = 0.25 m 
m\ = m 2 = wip = 0.1 kg, ra/^2 = 2 kg 



J01 = JOI = 0.0083 kgm z 
J hi = J h2 = OAkgm 2 , J = 0.0005 kgm 2 



(41) 



{EI) 1 = (EI) 2 = 1 Nm Z 

The natural frequencies fij = wi/2ji and the remaining parameters in the model coefficients 
are computed as (Miranda, 2004): 

f n = 0.48 Hz,f n = 1.80 Hz, 



f 2l = 2.18 Hz,f 22 = 15.91 Hz, 

<p ne = 0.186, <p l2e = 0.215, < e = 0.657, (p[ le = -0.560, 

<p 2le = 0.883, cp 22e = -0.069, (p' 2le = 2.641, <p' 22e = -10.853, 

v n = 0.007, v 12 = 0.013, v 21 = 0.033, v 22 = 0.054, 

w n = 0.002, w 12 = 0.004, w 2l = 0.012, w 22 = 0.016 



(42) 



In order to design the proposed adaptive controller, the plant needs to be linearized and the 
transfer function matrix be obtained. After linearization, neglecting higher order terms, and 
tremendous amount of algebra, it can be shown (Miranda, 2004) that the plant G po (s) = 
y p (s)/u p (s) with nominal parameters can be obtained as, 



G p0 (s)= 



y P i( s ) 
y p2 (s). 



0.01641s lu + 7.061* 10"V 



12 



10 



■r- -h 3.68 * 10" s- 
-7.357s 10 - 9.636 *10"V 



-2.259s 10 - 1.362 *10s 



1317 s 10 + 0.674s 8 



12 



5*10" 



u pi( s ) 

U P2 ( S )_ 



(43) 
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where, G po (s) is the nominal plant transfer function matrix. Now, performing minimal 
realization, G po (s) can be reduced to 



G po (& 



0.01641s 2 + 7.061 *1(P 
/+3.68*10"V 



-2.259s -1.362 * 10- 



s H +: 



/+3.68*10 s 
1317 s 2 + 0.674 

s 4 +2 



(44) 



From (44), it is straight forward to obtain the actual plant in general form as 



G P (& 



u p( s ) 



gV+C C,'V+(ff 
s*+Bl 2 s 2 



C 2l s 2 +Cf C 22 s 2 +Cf 
s 4 +B 2l s 2 



s'+Bfs 2 



(45) 



where, above coefficients of G p (s) are functions of plant parameters and can vary with the 
range as defined below: 



C lJ <C lJ .< C J . 

—p-J p-J p-J 

B ij < B ij < B ij 



i = l,2, 7=1,2 
i = h2, 7=1,2 



(46) 



The values of the nominal plant parameters are defined in the following table. The range 
considered for each parameter is ±30%. 



Parameter 


Nominal 


Range 


c 11 


0.01641 


0.011487 to 0.02133 


c 11 


7.061 *10 -5 


4.9427 * 10" 5 to 9.1 793 *10" 5 


Dll d12 d21 d22 

B 2 -B 2 -B 2 -B 2 


3.68 no -3 


2.576*10~ 3 to 4.784*10" 3 




2.259 


1.5813 to 2.9367 


c 12 


1.362 no -3 


9.583*10 -4 to 1.7797*10" 3 


c 21 


7.357 


5.1499 to 9.5641 


c 21 


9.636*10 -3 


6.7452*10"' to 12.5268 *10~ 3 


c 22 


1317 


921.9 to 1712.1 


c 22 


0.674 


0.4718 to 0.8762 



Table 1. Plant parameters, nominal values, and variation range. 
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For comparison reasons, uncompensated response of the nominal plant is given below 



Linear Plant Test 




500 1000 1500 2000 2500 3000 



Fig. 2. Uncompensated response of the nominal plant 



3. Controller Design 

Consider now that the plant given by (45) is represented by the following state-space 
equations: 



x p (t)=A p x p (t) + B p u p (t) 

y P {t)=c p x p {t) 



(47) 



where x p (t) is the (n xl) state vector, u p (t) is the (raxl) control vector, y p (£) is the (q x 1) plant 
output vector, and A p/ B p and C p are matrices with appropriate dimensions. The range of the 
plant parameters given by (46) is now given by 



al <a p (i,j)<aij,i,j = l,...,n 
b tj <b p (i,j)<bij,i,j = l,...,n 



(48) 



where a p (i, j) is the (z, ;) th element of A p and b p (i, j) is the (z, ;) th element of B p . Consider also 
the following reference model, for which plant output is expected to follow the model 
output without explicit knowledge of A p and B p . 



x m (t) = A m x m (t)+B m u m {t) 

y m (t)=c m x m (t) 



(49) 



In light of this objective, consider now the following output feedback adaptive control law, 
u p (0 = K e (t)e y (0 + K x (t)x m (0 + K u (t)u m (t) (50) 
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where e y (t) = ym(t)-y P (t) and K e (t), K x (t), and K u (t) are adaptive gains defined below. The 
control law consists of a feedback term from output error and a feedforward terms from 
model states and inputs. The adaptive gains K e (t), K x (t), and K n (t) are combination of 
proportional and integral gains as given below, 

K j (t) = K pj (t) + K ij (t) j = e,x,u (51) 

and they are updated according to the following adaptation law (Kaufman, et. al. 1998, 
Ozcelik & Kaufman, 1999) 

K pi (0 = e y (t)[e v (t) + x m (t) + u m (t)]T j = e,x,u,T>0 (52) 



K,j (t) = e y (t)[e y (0 + x m (0 + u m (1)% j = e,x,u, T^O (53) 

where T{ and T p are constant proportional and integral weighting matrices, respectively. It is 
seen from (53) that the term Ky(t) is a perfect integrator and may steadily increase whenever 
perfect following (e y (t) = 0) is not possible. The gain may reach unnecessarily large values, or 
may even diverge. Thus, a o-term is introduced in order to avoid the divergence of integral 
gains (Ionnou & Kokotovic, 1983). With the o -term, Ki(t) is now from a first-order filtering 
of e y (t)r T (t)T{ and therefore cannot diverge, unless e y (t) diverges. However, in this context, 
the o -term does more for the concept of 'adaptive control' . The gains increase only if high 
gains are needed and decrease if they are not needed any more. They are also allowed to 
change at any rate without affecting stability, such that the designer can adjust this rate to fit 
the specific needs of the particular plant. Thus, using o -term we rewrite the equation (53) as 
follows, 

K (J (0 = e y (t)[e y (0 + x m (0 + u m (t)]T t - aK y (t) j = e,x,u (54) 

For this adaptive control to work and for asymptotic tracking to be achieved, the plant is 
required to be almost strictly positive real (ASPR) (Bar-Kana, 1994); that is, there exists a 
gain matrix K e , not needed for implementation, such that the closed-loop transfer function 

G c (s) = [I + G p (s)KJ- 1 G ;) (s) (55) 

is strictly positive real (SPR). And that it can be shown that (Kaufman, et. al., 1998) a MIMO 
system represented by a transfer function G p (s) is ASPR if it: 

a) is minimum phase (zeros of the transfer function are on the left-half plane), 

b) has relative degree of m or zero (i.e., the difference in the degree of denominator 
and numerator, (n-m=m) or (n-m=0)), and 

c) has minimal realization with high frequency gain C V B V > (positive definite). 

Obviously, the plant given by (45) does not satisfy the so-called ASPR conditions and that 
can not be applied. However, it has been shown in (Kaufman, et. al., 1998) and (Ozcelik, 
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2004) that there exist a feedforward compensator H(s) such that the augmented plant G a (s) = 
G p (s) + H(s) is ASPR and the proposed adaptive algorithm can be implemented confidently. 

3.1. Design of a Feedforward Compensator (FFC) for the Flexible Robot 

From the above restrictions it is obvious that the plant given by (45) is not ASPR and that an 
FFC has to be designed. Now consider the actual plant G p (s) again, 



G„(s) = 



s'+Bl 



■C 



Cfs 2 +Cf Cfs 2 +C 



s + 1 



(56) 



Assuming that the nominal plant parameters are known, the parametric uncertainty of the 
plant can be transformed into a frequency dependent additive perturbation of the plant by 
representing of the actual plant G p (s) as G p (s) = G p o(s) + A a (s), with G p o(s) being a nominal 
plant and A a (s) being a frequency dependent additive perturbation. Then, one can write 



A a (s) = G(s)-GJs) 



(57) 



From (57), the additive uncertainty transfer function can be obtained as 

X 21 {s) A^(s) 



(58) 



where, 



M s ) 



A 12 (s)-- 



A 21 (s)-- 



A 22 (s) 



{Cf -0.01641^s 6 +(3.68*10 3 Q n + C n -7*10 5 - 0.01641B^y +(3.68*10 3 C n -7*10 ^BfY 









s 8 +(3.68*10 3 


+ Bfy +Bf 3.68' 


^lOV 














(2.25 + Cf)s 6 


+ (3.68*10 


3 Cf+C 12 +1.36 


*10 3 +2.25Bfy 


+ (3.68* 


10 


5 C 12 + 


1.36 * 


10 


X 


V 








s 8 +(3.68*10 3 


+ Bf> 6 +fr 1 3.68* 


10V 














(7.32 + Cf)s 6 


+ (3.68*10 


- 3 Cf+C 21 +9.63 


*10 3 +7.35B 2 2i y 


+ (3.68* 


10 


3 C 21 + 


9.63' 


'lO - 


3 B 2 2 


y 








s 8 +(3.68*10 3 


+ BfY +^3.68" 


'10 V 














{Cf 


-1317) 


s 6 +(3.68* 


10 3 Cf +Cf - 0.674 -1317B 21 }; 4 


+ (3.68 


*10 


-3^22 


-0.674B 21 ) 2 





s 8 +(3.68*10" 3 +Bf> 6 + B 2 21 3.68*l(rV 



(59) 



It is seen that the uncertainty is a function of plant parameters, which vary in a given range. 
Thus, in the design of a feedforward compensator, the worst case uncertainty should be 
taken into account. To this effect, the following optimization procedure is considered for 
determining the worst case uncertainty at each frequency (suitable number of discrete 
values). Define a vector whose elements are plant parameters, i.e. 
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= [c 



c tJ 



Qij T>ij T>ij 

^0 D r D r-\ 



n\ 



(60) 



Then 



maximize U- ? (jiv)\ at each w 

V 

subject to : 



c^.<q_ i <c;. i 



(61) 



where Xy is the z/ th element of A(jw). In other words, this optimization is performed for each 
element of A(jw). After having obtained the worst case (maximum) perturbation, we will 
assume that the perturbation is not exactly known but its upper bound is known. In other 
words, there exists a known rational function as an upper bound of the worst case 
uncertainty. Now the upper bound is characterized by an element by element interpretation, 
where the upper bound means that each entry of X(jw) is replaced by its corresponding 
bound. In other words, given the worst case uncertainty for each X(jw), it is assumed that 
there exists a known rational function Wij(s) G RHoo such that 



j(ji»]>max\^(jw]Vw 



(62) 



Knowing that the plant parameters can vary within their lower and upper bounds, this 
parametric uncertainty is formulated as an additive perturbation in the transfer function 
matrix. It is important to note that the controller be designed with respect to worst case 
uncertainty for each X,y. This can be achieved by performing an optimization procedure 
given by (61) for 200 frequencies. Here an element by element uncertainty bound model is 
used for the characterization of upper bound of the uncertainty matrix. Then wy , which 
satisfies (62) for each A y - is given in matrix form as, 



W(s)- 



7*10 2 



9*10 4 



800s 2 + 22s + 0.05 80s 2 + 4s + 0.05 



425*10 j 



2*10" 



150s 2 5.75s + 0.05 25s 2 + 3.75s + 0.125. 



(63) 



The magnitude responses for each max( | Xtj | ) and the corresponding ( | w%] \ ) are given in 
Figures 3-6. Having obtained the nominal plant and formulated unmodeled dynamics, let's 
have the following assumptions on the plant, 

Assumption 1: 

a) The nominal plant parameters are known. 

b) The off-diagonal elements ofG po (s) and A a (s) are strictly proper. 

c) A a (s) € RHoo mxm and satisfies (62) 
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Now, consider the augmented nominal plant with the parallel feedforward compensator 

G ao {s)=G po {s)+H{s) (64) 

and the following lemma 

Lemma 1: 

Let the feedforward compensator H(s) be of the form, 



H(s)- 



K 

h 22 




K 



(65) 



with each element hu(s) of a feedforward compensator being relative degree zero, then the augmented 
nominal plant G ao (s) = G po (s)+H(s) will have positive definite high frequency gain and relative 
McMillan degree zero (Ozcelik & Kaufman, 1999). 
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In other words, the new plant G ao (s) including H(s) becomes ASPR. Now that the ASPR 
conditions are satisfied for the nominal plant case, we next need to guarantee that the ASPR 
conditions are also satisfied in the presence of plant perturbations. To this effect, consider 
the following theorem 

Theorem 1: IfH(s) is designed according to the following conditions, then the augmented plant 
G a (s) = G p (s)+H(s) with the plant perturbations will be ASPR. 

a) H(s) is stable with each hjj (s) being relative degree zero 

b) H' 1 (s) stabilizes the nominal closed loop system 
c )A(s)eRH x and\\A(s)\\ <1 

where A(s) is the uncertainty of the augmented plant and given in the following (Ozcelik & 
Kaufman, 1999) 



A(s)=(G p0 (s)+H(s)Yw(s) 



(66) 



In light of the Theorem 1, we can readily determine the FFC as follows: 

a) The order of each element hu(s) of a feedforward compensator is chosen to be 
equal to the order of the corresponding diagonal element of the nominal plant 

G p0 (s). 



H(S): 



h l s A + h 2 s 3 + h 3 s 2 + h 4 s + h 5 

s 4 + 4s 3 + 6s 2 +4s + \ 





h 6 s 4 + h 7 s 3 + h s s 2 + h 9 s + /^ 

s 4 +4s 3 +6s 2 +4s + l 



(67) 



Denominator of the compensator H(s) appears in the numerator of the closed- 
loop transfer function and therefore, can be pre-determined such that its time 
constant is fast enough that its dynamics is negligible, 
b) Compensator parameters are determined from the following optimization 
procedure: 



minimize AQk;) 
x 

subject to : Re al \roots(Z(s))\ < 



(68) 



where Z(s) is the characteristic polynomial of the nominal closed-loop system matrix and X 
is a vector composed of the parameters of each Hjj(s). 



numerator of h = 3. 7444,- V + 4.6805e"V + 9.5966e"V + 1.1650e" 3 s + 4.03 10e" 4 
numerator of h 22 = 9.2748e" 5 ^ 4 + 5.7746e"V + 2.3495e"V + 2.9786e" 5 s + 5.6472e" 4 K } 
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With this FFC, all the conditions of Theorem 1 are satisfied. Thus, the augmented plant 
satisfies the almost strictly positive real conditions over a wide range of plant parameter 
variations. It is expected that the DMRAC augmented with this feedforward compensator 
will be robust for the maximum deviations from the nominal plant. The block diagram of 
the overall control system is depicted in Figure 7. 



His) : 



M 4 + ki3 3 + has 2 + hjs + h 5 
s 4 + 4s 3 + 6s 2 + As + 1 



h-QS 4 + /7.7 s 3 + hs 2 + hgs + hio 
s 4 + 4s 3 + 6s 2 + As + 1 



feedforward compensator 



-> i m (t) = A m x m (t) + B m u m (t) 
-> Vm(t) = C m x m (t) 



« P 2 



»/*] 



reference model 



Vrn\ 



Vm2 



u p (t) = K e (t)e y (t) + K x (t,)x m (t) + tf u (t)u m (J) 

K J (i) = K PJ (t.)+K ij (t) j = e,x,u 

Kpj(t) = e y (t}[e y (t}+x m (t) + u m (t)]T p , T p >0 

Kjj(t) = e y (t)[e y (i)+x m (t) + u m (t)}Ti. Ii > 

adaptive control law 



<"=1 



6x2 



ffW = 



/lis 4 + /i^s 3 + fo 3 s 2 + /l 4 S + /l 5 

s 4 + 4s 3 + 6s 2 +4s + l 



fees 4 + /17s 3 + h&s 2 + foj + fci 

5 4 + 45 3 + 65 2 + 45+1 



feedforward compensator 



u p i 



Up! 




Vpi 



plant 



Fig. 7. DMRAC with Two-Link Arm 
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Zm=Ym+Rm 




Zp=Yp+Rp 



Matrix 
Concatenation 



Fig. 8. Simulink Block Diagram of the overall control system with nonlinear plant. 



4. Simulation Results 

The nonlinear equations were used in building the plant in Simulink/ MATLAB (Figure 8) 
and with the above described DMRAC algorithm, the following cases were simulated. 



4.1 Case 1 



• All initial conditions were set to zero. 

• For both links the reference models were set to G ml (s) = G m2 (s) = l/(30s + 1) 

• For both links tip displacements were set to=0.05m 

• Upper bound of plant parameters was used. 
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Fig. 9. Case 1: yi and reference model (left), yi and reference model (right) 





150 200 250 300 350 4Q0 



Fig. 10. Case 1: 61 and O2 (left), errors e z i and e Z 2 (right) 




Fig. 11. Case 1: Control inputs, u v i and w P 2 



4.2. Case 2 



• All initial conditions were set to zero. 

• For both links the reference models were set to G ml (s) = G m2 (s) = 1/ {15s + 1) 

• For both links tip displacements were set to=0.05m 

• Lower bound of plant parameters was used. 
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Fig. 12. Case 2: yi and reference model (left), yi and reference model (right) 





Fig. 13. Case 2: 6i and 62 (left), errors e z i and e Z 2 (right) 




50 100 



250 300 350 400 



Fig. 14. Case 2: Control inputs, u v i and w P 2 

From the Figures 9-11 of Case 1 and the Figures 12-14 of Case 2, we can see that the tips of 
both links in each case follow the reference input. In Case 2 we have used a faster reference 
model to show the effectiveness of the DMRAC. 



4.3. Case 3 



6^(0) = 5°, 6> 2 (0) = 0° 
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• For both links the reference models were set to G ml (s) = G m2 (s) = l/(30s + 1) 

• For both links tip displacements were set to=0.01m 

• Nominal set of plant parameters were used. 





Fig. 15. Case 3: yi and reference model (left), 1/2 and reference model (right) 




100 150 200 250 300 350 




350 400 



Fig. 16. Case 3: 61 and 62 (left), errors e z i and e Z 2 (right) 

From Figure 15 we see that at t=0, the tip position of link 2, 1/2 begins at zero, however due to 
the initial condition for 6i tip position of first link, yi is displaced with respect to the desired 
reference model at t=0. From Figure 16 it is seen that 61 and di come to steady-state, while 
errors approach to zero. 




Fig. 17. Case 3: Control inputs, u p i and u V 2 
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4.5. Case 4 

•^(0) = o , 6> 2 (0) = 0° 

• For both links the reference models were set to G ml (s) = G m2 (s) = l/(50s + 1) 

• For both links tip displacements were set to=± 0.01m 

• Nominal plant parameters were used 





Fig. 18. Case 4: yi and reference model (left), 1/2 and reference model (right) 





Fig. 19. Case 4: 61 and 62 (left), error e z (right) 



Fig. 20. Case 4: Control inputs, u v i and u V 2. 
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5. Conclusions 

Direct Model Reference Adaptive Control is utilized to control two-link flexible robot, 
whose parameters vary. The feedforward compensator was designed for the system and it 
was showed that the augmented plant satisfies the ASPR conditions over the range of 
parameter variations. As seen from the results of the tip position response, the system 
closely follows the reference model. During the simulations it was observed that DMRAC 
was capable of controlling the plant despite the changes in the plant parameters. The ease of 
its implementation and its robustness were demonstrated. 
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1. Introduction 

Adaptive control theory has been widely applied for stabilizing linear time invariant plants 
of unknown parameters (Goodwin & Sin, 1984). One of the more used methods for such a 
purpose is based on the model reference adaptive control (MRAC) problem (Astrom & 
Wittenmark, 1997). Such a method requires some assumptions relative to the plant to be 
controlled in order to carry out the synthesis of a stable controller (Narendra & 
Annas wamy, 1989). One of them is that the plant has to be inversely stable, what means that 
its zeros have to be located within the stability domain. However, this information is not 
always available to the designer when the system under control contains unknown 
parameters. There are several alternatives to circumvent this drawback and carry out the 
stable adaptive control design. Some of them consist on relaxing the control performance 
from the model matching to that achievable from the closed-loop pole placement (Alonso- 
Quesada & De la Sen, 2004 and Arvanitis, 1999). In this way, the stabilization of the closed- 
loop system can be ensured although its transient behaviour cannot be fixed to a predefined 
one. 

On one hand, the work (Alonso-Quesada & De la Sen, 2004) includes an estimates 
modification in the estimation algorithm to ensure the controllability of the estimated plant 
model without assuming any knowledge about the parameters of the plant to be controlled. 
This controllability property is crucial to avoid pole-zero cancellations between the 
estimated plant and the controller, which are both time-varying. In this context, a projection 
of the estimated plant parameters into a region in the parameter space where the closed- 
loop system is free of pole-zero cancellations for all time can be alternatively used provided 
that the true plant is controllable and the knowledge of a region where the true plant 
parameters belong to (Goodwin & Mayne, 1987). 

On the other hand, the research (Arvanitis, 1999) proposes an adaptive pole-placement 
control for linear systems using generalized sampled-data hold functions. Following such a 
technique, gain controllers essentially need to be designed. Concretely, a periodic piecewise 
constant gain controller is added in the feedback chain. In the non-adaptive case, such 
constant gain values are those required so that the discretized closed-loop model under a 
fundamental sampling period and a zero-order hold (ZOH) be stabilized. For such a 
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purpose, each sampling period is divided in a certain finite number of uniform subintervals 
and the controller gain takes a different value within each of them in order to locate the 
discretized poles at the stable desired locations. In other words, the controller consists of a 
constant vector of gains. In this sense, the controller works with a sampling rate faster than 
that used to discretize the plant to be controlled. In the adaptive case, an estimated model of 
the discretized plant is on-line updated by means of an estimation algorithm. Such a model 
is used to parameterize the controller gains vector which becomes time-varying and 
converges asymptotically to a constant one. 

Another alternative, which does not relax the MRAC objective, to overcome the 
drawback of the unstable zeros in a continuous-time plant is the design of discrete-time 
controllers which are synthesized from the discretization of the continuous-time plant by 
means of a holder device combined with a multirate with fast input sampling rate (De la Sen 
& Alonso-Quesada, 2007 and Liang & Ishitobi, 2004). The main motivation of this method is 
that an inversely stable discretized model of the plant can be obtained with an appropriate 
choice of the multirate gains. In this way, an adaptive controller can be designed to match a 
discrete-time reference model since all the discretized plant zeros may be cancelled if suited. 

In this context, a fractional-order hold (FROH) with a multirate input is used in this paper to 
obtain an inversely stable discretized plant model from a possible non-inversely stable and unstable 
time invariant continuous-time plant. Then, a control design for matching a discrete-time reference 
model is developed for both non-adaptive and adaptive cases. Note that a FROH includes as 
particular cases the ZOH and the FOH (first-order hold). In this way, the stabilization of the 
continuous-time plant is guaranteed without any assumption about the stability of its zeros 
and without requiring estimates modification in contrast with previous works on the 
subject. In this sense, this paper is an extension of the work (De La Sen & Alonso-Quesada, 
2007) where the same problem is addressed. The main contribution is related to the method used 
to built the continuous-time plant input from the discrete-time controller output. In the present 
paper, the FROH acts on the fundamental sampling period used to discretize the plant 
(plant output sampling) while in the aforementioned paper the FROH acted on the sampling 
period used to define the multirate device at the plant input. This later sampling period is 
an integer fraction of the plant output one, i.e. an integer number of input samples takes 
place within each output sampling period. Such an integer has to be suitably chosen for 
disposing of the enough freedom degrees being necessary to place the discretized plant 
zeros at desired locations, namely within the unity circle in order to guarantee the inverse 
stability of the discretized plant model. 

The assumptions about the plant to guarantee the closed-loop stability of the adaptive 
control system are the following: (1) the stabilizability of the plant and (2) the knowledge of 
the continuous-time plant order. The motivation for using a multirate sampling input 
instead of the most conventional single rate one resides in the fact that the former, with the 
appropriate multirate gains, provides an inversely stable discretized plant model without 
requirements on either the stability of the continuous-time plant zeros or the size of the 
sampling period. In this sense, a single rate input can only provide an inversely stable 
discretized plant from an inversely stable continuous-time plant and, moreover, the 
fundamental sampling period to discretized the plant has to be sufficiently small (Blachuta, 
1999). Finally, the use of a FROH, instead of the most conventional ZOH, allows to 
accommodate better some discrete adaptive techniques to the transient response of discrete- 
time controlled continuous-time plants (Barcena et al., 2000 and Liang et al., 2003). 
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The paper is organized as follows. Section 2 formulates a discrete state-space 
description under fast input sampling to then obtain an input-output discrete transfer 
function for the running slow sampling rate, namely, that acting on the output signal. The 
selection of the scalar gains that generate the fast sampled input so that the discrete plant 
zeros are stable is focused on depending on the continuous-time plant parametrization. 
Section 3 discusses the synthesis of a model-matching based controller with a possible 
potential free design of all the zeros of the reference model. The case of known plant 
parameters and the adaptive case for not fully known plant parameters are both considered. 
Two alternatives are proposed to update on-line the time-varying multirate gains in the 
adaptive case. The first one updates the multirate gains for all sampling instants in order to 
maintain the zeros of the estimated discretized plant fixed within the stability domain. On 
the contrary, the other one updates the multirate gains only when the change of gains is 
crucial to guarantee the stability of the estimated discretized plant zeros. In this way, the 
multirate gains are not updated for all sampling instants and then they became piecewise 
constant. As a result, the zeros of the estimated discretized plant become time-varying 
within the stability domain. Section 4 deals with the stability analysis of the adaptive control 
system. Simulated examples which highlight the proposed design philosophy are provided 
in Section 5. A comparison of the results obtained with the two different methods for 
updating the multirate gains is presented. Finally, conclusions end the paper in Section 6. 

2. Discretized Plant Representation 

Consider a linear time-invariant, single-input single-output and strictly proper continuous- 
time plant described by the following state space equations: 

x(t) = Ax(t) + Bu(t) ; y(t) = Cx(t) (1) 

where u(t) and y(t) are, respectively, the input and output signals, x(t) e 9l n denotes the 
state vector and A, B and C are constant matrices of suitable dimensions. A FROH and a 
multirate sampling on the fast input sampling will be used in order to obtain an inversely 
stable discretized plant model. The signal generated by such a device is given by, 

u(t) = «,{u(k) + p ^>-^-V *r)} P) 

for te[kT + (j-l)T',kT + jr), je{l, 2, ...,N} and FROH correcting gain (3 e [-1,0) u (0,1], 
where T is the sampling period (slow sampling) which is divided in N equal subperiods of 
length T' = %r (fast sampling) to generate the multirate plant input, u(k) denotes the 

value of a discrete-time controller output signal at the instant kT, for all non-negative 
integer k, and ol 's are real constants. Note that the FROH device operates on the sequence 

(u(k)} defined at the slow sampling instants kT and then the input u(t) is generated over 

each subperiod with the corresponding gain a- , for j e {1, 2, ..., N} , via (2). By substituting 

(2) into (1) and sampling the plant output y(t) over the sampling period T, the following 



1 1 6 Adaptive Control 

state space representation is obtained which corresponds to the discrete-time plant model 
that relates the sequences {u(k)| and {y(k)| : 

x(k + l) = Fx(k) + G 1 u(k) + G 2 u(k-l) ; y(k) = Cx(k) (3) 

where F = \|/ N = (|)(T) = e AT is the continuous-time state transition matrix computed for a 
slow sampling period and, 

G 1 =(ia < V N -'](r + ir] e SR»" 1 ; G 2 =-|[£o /V N -'jr e «»" 1 (4) 

with, 

r = f '<|>(T' - s) B ds e <R nxl ; V = f ' ty(T - s) B s ds e 9T xl (5) 

The transfer function of this discrete-time plant model is, 



,N,-ll r , V )_ B ( Z ) 



H(z) = C(zI n -y N )- 1 |G 1+ ^G 2 j = ^ (6) 



where, 



B(z) = C Adj(zl n - OC A (z)g = Det 
A(z) = z Det(zl n - V|/ N ) = z n+1 + X a i zn "' +1 



zI n - ¥ N C A (z)g 
-C 



=Ib iZ — 



(7) 



with Adj() and Det() denoting, respectively, the adjoint matrix and the determinant of the 
square matrix (.), I n denoting the n-th order identity matrix, and 



g = [a 1 -a N ] T e9r +1 ;C 4 (z)"-[y N - 1 A(z) ••• ¥ A(z) A(z) Je9?' 



(8) 



A( Z ) =z|r + |r'l-|r 



Note that the coefficients b { , for i e {1, 2, ..., n + 1}, of the polynomial B(z) in (7) 

depend on the values a. , for j e {1, 2, ..., N} , which define the multirate device. This fact 

lets to allocate the zeros of the discretized plant model at desired locations if a suitable 
number of multirate gains is provided. In this sense, the multirate gains a. , being the 

components of the vector g , are calculated to guarantee that such zeros are maintained 

within the stability domain, i.e., the unity circle. In particular, the coefficients b { can be 
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expressed as: 

N 

b i =£b M a,«v = Mg (9) 

where M = [b^ 1 e 9? (n+1)xN and v = [b a b 2 ... b n+1 ] . The coefficients b it] depend on the 

parameters of the continuous-time plant, the sampling period T and the correcting gain (3 of 
the FROH considered in the discretization process. 

Assumptions 1. 

(i) The plant is stabilizable, i.e. its transfer function does not possess unstable pole-zero 

cancellations, 
(ii) the plant order n is known, and 
(iii) the correcting gain (3 of the FROH device and the sampling period T are chosen such 

that M is a full rank matrix. *** 

Remark 1. The multirate gains vector g required to place the zeros of the discretized plant 
transfer function (6) at desired locations may be calculated from (9) provided Assumptions 1 
and that N > n + 1 . In this sense, such locations are prefixed via a suitable choice of the 
vector v composed by the coefficients of the desired polynomial for the transfer function 
numerator. If N > n + 1 , different solutions can be obtained for g . Otherwise, i.e. if 
N = n + 1 , there is a unique solution for the multirate gains vector from the linear algebraic 
system (9) which places the discretized zeros at desired locations. *** 

The discretized model (6) can be described by the following difference equation: 



y(k) = -Sa 1 y(k-i) + S b 1 u ( k - i ) = -Z a 1 y( k - i ) + ZZ b M^ u ( k - i ) 



i=l j=l 



(10) 



-£a, y(k-i) + X2X u j (k-i) = 8 T cp(k-l) 

1=1 1=1 j=i 



where, 



e = [ej ol A e£ 2 ... < n+1 ] T ; <p(k-i)=[<pj(k-i) qftk-i) <pJ(k-2) ... ^(k-n-i)] T 

8 a =[- ai -a 2 ... -a n f ; <p y (k-l) = [y(k-l) y(k-2) ... y(k-n)] T 

e b<i =[b w b u ... b, N ] T ; cp„(k-i) = [u 1 (k-i) u 2 (k-i) ... u N (k-i)f 
u,(k-i) = a,u(k-i) 



(11) 



for all i e {1, 2, ..., n + 1} and all j e {1, 2, ..., N} . In the rest of the paper, the case 
N = n + 1 will be considered for simplicity purposes. 
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3. Control Design 

The control objective in the case of known plant parameters is that the discretized plant 

model matches a stable discrete-time reference model H (z) = whose zeros can be 

A m (z) 

freely chosen, where z is the Z-transform argument. Such an objective is achievable if the 
discretization process uses the multirate sampling input with the appropriate multirate 
gains, what guarantees the inverse stability of the discretized plant. Then, all the discretized 
plant zeros may be cancelled by controller poles. In this way, the continuous-time plant 
output tracks the reference model output at the sampling instants. The tracking-error 
between such signals is zero at all sampling instants in the case of known plant parameters 
while it is maintained bounded for all time while it converges asymptotically to zero as time 
tends to infinity in the adaptive case considered when the plant parameters are fully or 
partially unknown. A self-tuning regulator scheme is used to meet the control objective in 
both non-adaptive and adaptive cases. 

3.1 Known Plant 

The proposed control law is obtained from the difference equation: 

R(q)u(k) = T(q)c(k)-S(q)y(k) (12) 

for all non-negative integer k, where {c(k)} is the input reference sequence and q is the 

running sample rate advance operator being formally equivalent to the Z-argument used in 
discrete transfer functions. The reconstruction of the continuous-time plant input u(t) is 
made by using (2), with the control sequence {u(k)| obtained from (12), with the 

appropriate multirate gains a., for j e {1, 2, ..., N} , to guarantee the stability of the 

discretized plant zeros. 

The discrete-time transfer function of the closed-loop system obtained from the 
application of the control law (12) to the discretized plant (6) is given by: 

Y(z) _ B(z)T(z) _ T(z) 

C(z) A(z)R(z) + B(z)S(z) A(z) + S(z) 

where the second equality is fulfilled if the control polynomial R(z) = B(z) . In this way, the 
polynomial B(z) , which is stable, is cancelled. Then, the polynomials T(z) , R(z) and S(z) 

of the controller (12) so that — — = are obtained from: 

C(z) A m (z) 

T(z) = B m (z)A s (z) ; R(z) = B(z) ; S(z) = A m (z)A,(z)-A(z) (14) 

where A s (z) is a stable monic polynomial of zero-pole cancellations of the closed-loop 
system. The following degree constraints are satisfied in the synthesis of the controller: 
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Deg[A m (z)] + Deg[A s (z)] = Deg[A(z)] = n + 1 

Deg[S(z)] = Deg[A(z)]-l = n => S(z) = Js i+1 z"" 5 (15) 

i=0 

Deg[T(z)] = Deg[B m (z)] + Deg[A s (z)] < n 

3.2 Unknown Plant 

If the continuous-time plant parameters are unknown then the vector 6 in (11) composed of 
the discretized plant model parameters is also unknown. However, all the above control 
design in the previous subsection remains valid if such a parameter vector is estimated by 
an estimation algorithm. In this way, the controller parameterization can be obtained from 

R(z,k) = B(z,k) , with B(z,k) denoting the estimated of B(z) at the current slow sampling 
instant kT, and equations similar to (14) by replacing the discretized plant polynomial A(z) 
by its corresponding estimated one A(z,k) (Alonso-Quesada & De la Sen, 2004). Note that 
T(z) in (14) has to be calculated once for all since B m (z) and A s (z) are time-invariant while 
S(z) is updated at each running sampling time since the polynomial A(z,k) is time- 
varying. The coefficients of the unknown polynomial B(z) depend, via (9), on the multirate 
input gains c^ , f or j e {1, 2, ..., N} , being applicable to calculate the input within the inter- 
sample slow period. However, the estimation algorithm provides an adaptation of each 
parameter by , namely b { .(k) , for i, j e {1, 2, ..., N} and all non negative integer k. Then, 
the a. -gains have to be also updated in order to ensure the stability of the zeros of the 
estimated discretized plant, i.e. the roots of B(z,k) be stable. Then, the gains a. become 
time-varying, namely a-(k) . The estimation algorithm for updating the parameters vector 

6(k) , which denotes the estimated of 6 , and two different design alternatives for the 
adaptation of the multirate gains are presented below. Also, the main boundedness and 
convergence properties derived from the use of such algorithms are established. 

3.2.1. Estimation algorithm 

An 'a priori 7 estimated parameters vector is obtained at each slow sampling instant by using 
a recursive least-squares algorithm (Goodwin & Sin, 1984) defined by: 

p^ptk-i)-^-^- 1 ^- 1 ^- 1 ) 

v ' l + cp T (k-l)P(k-l)cp(k-l) 

e°(k) = e»(k - 1) + P(k-i)9(k-i)e'(k) (16) 

v ' l + cp T (k-l)P(k-l)cp(k-l) 

for all integer k > where e°(k) = (e - 6° (k - 1)) T cp(k - 1) = 6° T (k - l)<p(k - 1) denotes the 'a 
priori 7 estimation error and P(k) is the covariance matrix initialized as P(0) = P T (0) >0 . 
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Such an algorithm provides an estimation 0°(k) of the parameters vector by using the 
regressor cp(k - 1) , defined in (11), built with the output and input measurements with the 
multirate gains d^k-1) obtained at the previous slow sampling instant, i.e. 
u . (k - i) = d . (k - 1) u(k - i) for all i e {1, 2, . . . , n+1} . Then, an 'a posteriori 7 estimates vector 
is obtained in the following way: 

Modification algorithm. 

This algorithm consists of three steps: 

Step 1 : Built the matrix M°(k) = [b? j (k)l e9? NxN , for i, je{l, 2, ..., N} , from the 'a priori' 

estimates 0° ^(k) , included in 6° (k) , of the corresponding Q hi defined in (11). 

Step 2 : M(k) = M°(k) 

J/Det[M(k)]>5 thenQ hA (k) = Q° hA (k) 

else while DetrM(k)l<5 

M(k) = M(k) + 5I N 
end; 
for i = l to N 

e w (k) = M i (k) 

end- 
end. 

Ste E 3:e(k) = pf(k) Ql A (k) 9* 2 (k) ... eJ, N (k)]\ 

for some real positive constants 5 « 1 and 8 « 1 , and where M^k) denotes the z'-th row 
of M(k) . 

Remark 2. Note that the estimate 0°(k) corresponding to the parameters of 9 a is not 
affected by the modification algorithm. Also, note that the while instruction part of the 
second step is doing a finite number of times since there exists a finite integer number £ 
such that Det[M(k)] = Det[M (k) + ^I N ] = (^) N +f(5, 9^(k), 9° 2 (k), ..., eJ, N (k))>8 . 

3.2.2. Updating of the time-varying multirate gains 

Once the estimated parameters vector is obtained at each slow sampling instant the 
multirate input gains have to be updated. Two alternative algorithms are considered to 
carry out such an operation. 

Algorithm 1. 

A vector of multirate gains is updated at all slow sampling instants in order to maintain the 
zeros of the estimated discretized plant fixed at desired locations within the stability domain 
|z| < 1 . Such desired zeros are the roots of a predefined polynomial B'(z) . For such a 
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purpose, the required vector g(k) is obtained from the resolution of the following matrix 
equation: 

M(k)g(k) = v (17) 

at each slow sampling instant, where v = [b^ b' 2 ... b' N ] is composed by the coefficients 
of B'(z) , M(k)= ^.(k) e$R NxN , with b { .(k) denoting each of the 'a posteriori 7 estimated 
parameters corresponding to the components of the vectors Q hi defined in (11), and 

g(k) = [d 1 (k) d 2 (k) ... d N (k)] . In this way, g(k) is composed by the multirate gains 

which make the numerator of the estimated discretized plant model be equal to the desired 
polynomial B'(z) . Note that the matrix equation (17) can be solved at all slow sampling 

instants since the parameters modification added to the estimation algorithm ensures the 
non-singularity of the matrix M(k) . 

Algorithm 2. 

It consists of solving the equation (17) only when it is necessary to modify the previous 
values of the multirate gains in order to guarantee the stability of the zeros of the estimated 
discretized plant model, i.e., the multirate gains remain equal to those of the preceding slow 
sampling instant if the zeros of the estimated discretized plant obtained with the current 
estimated parameters vector, 6(k) , and the previous multirate gains, d^k-1), are within 

the discrete-time stability domain. Otherwise, the multirate gains are updated by the 
resolution of the equation (17), which can be solved whenever it is necessary since the 
matrix M(k) is invertible at all slow sampling instant due to the modification included in 
the estimation algorithm. In this way, the multirate gains are piecewise constant, the 
estimated discretized plant zeros are time-varying and the computational burden associated 
with the updating of the multirate gains is reduced with respect to that of Algorithm 1. 

3.2.3. Properties of the estimated models 

The parameter estimation algorithm, together with any of the considered adaptation 
algorithms for the multirate gains, possesses the properties given in the following lemma, 
whose proof is presented in Appendix A. 

Lemma 1. Main properties of the estimation and multirate gains adaptation algorithms 

(i) P(k) is uniformly bounded for all non-negative integer k, and it asymptotically 

converges to a finite, at least semidefinite positive, limit as k — » go . 
(ii) 0°(k) and 6(k) are uniformly bounded and they asymptotically converge to a finite 

limit as k — » go . 
(iii) The vector g(k) of multirate gains is bounded and converges to a finite limit as k — » go . 

(e°(k)) 2 

(iv) is uniformly bounded and it asymptotically converges to 

1 + cp (k - 1) P(k - 1) cp(k - 1) 
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zero as k — » oo . 
(v) e°(k) asymptotically converges to zero as k — » oo . 
(vi) Assuming that the external input c(k) is sufficiently rich such that cp(k - 1) in (11) is 

persistently exciting, 0°(k) tends to the true parameters vector 6 as k — » go . Then, 6(k) 

tends to 9°(k) and e(k) = (d- 6(k - 1)) (p(k - 1) tends to zero as k -> oo . *** 

Remark 3. The convergence of the estimated parameters to their true values in 6 requires 
that cp(k - 1) is persistently exciting. In this context, cp(k - 1) is persistently exciting if there 

k +£ 

exists an integer £ such that p 1 I m > ^ (p(k-l)(p T (k-l) > p 2 I m where p a >0, p 2 >0 and 

k=k 

m = n + N 2 = n 2 + 3n + 1 is the number of components of the regressor cp(k - 1) . Such a 
condition may be ensured by chosing an external input sufficiently rich of order m , i.e. it 
consists of at least m / frequencies in the frequency domain (Ioannou & Sun, 1996). *** 

4. Stability Analysis 

The plant discretized model can be written as follows, 

y(k) = y(k) + e(k) = 6 T (k - l)<p(k - 1) + e(k) = -£a,(k - l)y(k - i) + ^(k - l)u(k - i) + e(k) (18) 

i=l i=l 

and the adaptive control law as, 

u(k) = ^{Z(Mk-l) ai (k-^ 

b x (k)Ui i=i v } q 9 j 

-s 1 (k-l)b n+1 (k-l)u(k-n-l) + Xb mi c(k-i + l)}-^e(k) + 8(k) 

i=i J b x (k) 

where (12) has been used with R(q) and S(q) substituted, respectively, by time-varying 

polynomials R(z,k) = B(z,k) and S(z,k), which is the solution of the equation (14) for the 
adaptive case, and, 

-X[(s 1 (k)-s 1 (k-l))b i (k-l) + (b i+1 (k)-b 1+1 (k-l))]u(k-i) (20) 

-(S 1 (k)-S 1 (k-l))b n+1 (k-l)u(k-n-l)} 

By combining (18) and (19), the discrete-time closed-loop system can be written as: 
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x(k) = A(k - 1) x(k - 1) + ¥ a e(k) + ¥ 2 3(k) 



(21) 



where 0(k) = 



b a (k) 



Xb mi c(k-i + l)-s 1 (k)e(k) +5(k)and, 



x(k-l)=[y(k-l)y(k-2> • -y(k-n)u(k-l)u(k-2> • -u(k-n-l j 



^[lO-Ofe^ 21 ^ 1 ;^ 



00-010-0 



e9f r 



A(k-1)= 



-a,(k-l) -a 2 (k-l) 

1 

1 



f^k-1) f,(k-l) 



b,(k) 




b,(k) 




-a^Ck-1) -a n (k-l) b^k-l) b 2 (k-l) 


: 

o ; ; i 

i ooo 

f_,(k-l) f(k-l) h,(k-l) ^(k-l) 



b,(k) 




¥k) 




b,(k) 
1 



b,(k) 




b n (k-l) b n+1 (k-l) 






h(k-l) h n+1 (k-l) 

¥k) 





b,(k) 




(22) 



with f i (k-l) = s 1 (k-l)a i (k-l)-s i+1 (k-l), h i (k-l) = -(s i (k-l)b i (k-l) + b i+1 (k-l)), for 
ie{l, 2, ...,n},and h n+1 (k-l) = -s a (k-l)b n+1 (k-l) . 

N A 

Note that a\(k-l) and b^k-l) = ^ J b i j (k-l)d j (k-l) are uniformly bounded from 

j=i 

Lemma 1 (properties ii and iii). Also, b^k) ^0 since the adaptation of the multirate gains 
makes such a parameter fixed to a prefixed one which is suitably chosen and s t (k-l) is 
uniformly bounded from the resolution of a equation being similar to that of (14) replacing 
polynomials A(z) and S(z) by time-varying polynomials A(z,k-1) and S(z,k-1), 

respectively. 

The following theorem, whose proof is presented in Appendix B, establishes the main 
stability result of the adaptive control system. 

Theorem 1. Main stability result. 

(i) The adaptive control law stabilizes the discrete-time plant model (6) in the sense that 
{u(k)| and {y(k)| are bounded for all finite initial states and any uniformly bounded 

reference input sequence {c(k)| subject to Assumptions 1, 

(ii) {y(k)| converges to {y m (k)| as k tends to infinity, and 

(iii) the continuous plant input and output signals, u(t) and y(t) , are bounded for all t. *** 
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5. Simulations Results 

Some simulation results which illustrate the effectiveness of the proposed method are 

shown in the current section. A continuous-time unstable plant of transfer function 

s-2 

G(s) = with an unstable zero, and whose internal representation is defined by 

(s-l)(s + 3) 



the matrices A = 



-3 0" 
1 



, B = [l 1] T and C = [l.25 -0.25], is considered. A suitable 



multirate scheme with fast input sampling through a FROH device is used to place the zeros 
of the discretized plant within the stability region and a discrete-time controller is 
synthesized so that the discrete-time closed-loop system matches a reference model. The 
results for the case of known plant parameters are presented in a first example and then two 
more examples with the described adaptive control strategies are considered. The difference 
among such adaptive control strategies relies on the way of updating the multirate gains for 
ensuring the stability of the estimated discretized plant zeros. 

5.1. Known Plant Parameters 

The discretization of the continuous-time plant with a multirate, N = 3 , and a FROH device 
with (3 = 0.7 for a slow sampling time T = 0.3 is performed leading to the discrete transfer 

function H(z) = ^ = bl( f )z + h ^ z + h ^ where b 1 (g) = 0.0307a 1 +0.0693a 2 +0.13a 3/ 
v J A(z) z(z 2 -1.7564z + 0.5488) 1V6/ a 2 3 

b 2 (g) = -(0.0788a, +0.1488a 2 +0.2631a 3 ) and b 3 (g) = 0.0083a a +0.0343a 2 +0.0797a 3 are the 

coefficients of the transfer function numerator of the discretized model. Such coefficients 

depend on the multirate gains a { , for i e {1, 2, 3} , included as components in the vector g . 

The zeros of such a discretized plant can be fixed within the stability domain via a suitable 

choice of the multirate gains. In this example such gains are a t = -621.8706 , oc 2 = 848.4241 

and ot 3 = -297.4867 so that B(z) = B'(z) = z 2 + z + 0.25 and then both zeros are placed at 

z = -0.5 . The control objective is the matching of the reference model defined by the 

z 2 +z -0.272 

transfer function G m (z) = '— — . For such a purpose, the controller has to cancel the 

mV ' (z + 0.2) 3 F F 

discretized plant zeros, which are stable, and add those of the reference model to the 
discrete-time closed-loop system. The values of the control parameters to meet such an 
objective are s 1 = 2.3564 , s 2 = -0.4288 and s 3 = 0.008 . A unitary step is considered as 

external input signal. Figure 1 displays the time evolution of the closed-loop system output, 
its values at the slow sampling instants and the sequence of the discrete-time reference 
model output. Figure 2 shows the plant input signal. Note that perfect model matching is 
achieved, at the slow sampling instants, without any constraints in the choice of the zeros of 
the reference model G m (z) , in spite of the continuous-time plant possesses an unstable zero. 
Furthermore, the continuous-time output and input signals are maintained bounded for all 
time. 
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5.2. Unknown Plant Parameters 

An adaptive version of the discrete-time controller designed in the previous example is 
considered with the parameters estimation algorithm being initialized with 

9°(0) = 10" 2 x [263.46 -82.32 4.61 10.39 19.51 -11.82 -22.33 -39.46 1.25 5.15 11.95] T 
and P(0) = 1000 • I n . Furthermore, the values 5 = 5 = 10~ 6 are chosen for the modification 
algorithm included in such an estimation process. Two different methods are considered to 
update the multirate gains. The first one consists of updating such gains at all the slow 
sampling instants so that the discretized zeros are maintained constant within the stability 
domain (Algorithm 1). The second one consists of changing the value of the multirate gains 
only when at least one of the discretized zeros, which are time-varying, is going out of the 
stability domain. Otherwise, the values for the multirate gains are maintained equal to those 
of the previous slow sampling instant (Algorithm 2). 

5.2.1. Algorithm 1: Discretized plant zeros are maintained constant 

Figure 3 displays the time evolution of the closed-loop adaptive control system output, its 
values at the slow sampling instants and the sequence of the discrete-time reference model 
output under a unitary step as external input signal. Note that the discrete-time model 
matching is reached after a transient time interval. Figures 4 and 5 show, respectively, the 
plant output signal and the input signal generated from the multirate with the FROH 
applied to the control sequence {u(k)| . It can be observed that both signals are bounded for 

all time. Finally, Figures 6 and 7 display, respectively, the time evolution of the multirate 
gains and the adaptive controller parameters. Note that the multirate gains and the adaptive 
control parameters are time-varying until they converge to constant values. 
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Fig. 3. Plant and reference model output signals 



Discrete Model Matching Adaptive Control for Potentially Inversely Non-Stable Continuous-Time 127 
Plants by Using Multirate Sampling 



x 10 




7 8 9 10 



Fig. 4. Plant output signal 



x10 



u(t) 



1.5 

1 

0.5 


-0.5 

-1 
-1.5 

-2 



I I I 


1 III II- 


J 


J / 


— -^ujllfl 


|fc>~ — : 




N ' 


\ 


\ 


1 1 1 


■ i i i i i 



1 



10 



Fig. 5. Plant input signal 



128 



Adaptive Control 



1200 
1000 

800 

600 

400 

200 


-200 K 

i i 

-400 

-600 

-800 

1 

Fig. 6. Multirate gains 




9(1) 

-- g(2) 

- - ■ g(3) 



10 



3.5 

3 
2.5 

2 
1.5 

1 
0.5 

- 

-0.5 

-1 




10 



Fig. 7. Adaptive control parameters 
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5.2.2. Algorithm 2: Discretized plant zeros are time-varying 

The multirate gains are maintained constant to their values at the previous slow sampling 
instant until at least one of the discretized plant zeros is going out of the stability domain. In 
this sense, note that the discretized zeros vary when the values of the multirate gains are 
maintained constant and eventually they can go out of the stability domain. When this 
happens such gains are again calculated to place both discretized zeros at z = -0.5 . The 

discrete-time model matching is reached after a transient time interval and the continuous- 
time plant output and input signals are bounded for all time as it can be observed from 
Figures 8, 9 and 10 where the response to a unitary step is shown. The maximum values 
reached by both continuous-time output and input signals are larger than those obtained 
with the previous method (Algorithm 1) for updating the multirate gains. Figures 11 and 12 
display, respectively, the evolution of the multirate gains and the controller parameters. The 
adaptive control parameters are time-varying until they converge to constant values while 
the multirate gains are piecewise constant and also they converge to constant values. Note 
that this second method ensures a small number of changes in the values of the multirate 
gains compared with the first method since such gains only vary when it is necessary to 
maintain the zeros within the stability domain. This fact gives place to a less computational 
effort to generate the control law than that required with the first method. However, the 
behaviour of the continuous-time plant output and input signals is worse with the use of 
this second alternative in this particular example. Finally, the evolution of the modules of 
the discretized plant zeros and the coefficients of the time-varying numerator of such an 
estimated model are, respectively, shown in Figures 13 and 14. 
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6. Conclusion 

This paper deals with the stabilization of an unstable and possibly non-inversely stable 
continuous-time plant. The mechanism used to fulfill the stabilization objective consists of 
two steps. The first one is the discretization of the continuous-time plant by using a FROH 
device combined with a multirate input in order to obtain an inversely stable discretized 
model of the plant. Then, a discrete-time controller is designed to match a discrete-time 
reference model by such a discretized plant. There is not any restriction in the choice of the 
reference model since the zeros of the discretized plant model are guaranteed to be stable by 
the fast sampled input generated by the multirate sampling device. 

An adaptive version of such a controller constitutes the main contribution of the 
present manuscript. The model matching between the discretized plant and the discrete- 
time reference model is asymptotically reached in the adaptive case of unknown plant. Also, 
the boundedness of the continuous-time plant input and output signals are ensured, as it is 
illustrated by means of some simulation examples. In this context, the behaviour of the 
designed adaptive control system in the inter-samples period may be improved. In this 
sense, an improvement in such a behaviour has been already reached with a multi- 
estimation scheme where several discretization/ estimation processes, each one with its 
proper FROH and multirate device, are working in parallel providing different discretized 
plant estimated models (Alonso-Quesada & De la Sen, 2007). Such a scheme is completed 
with a supervisory system which activates one of the discretization/ estimation processes. 
Such a process optimizes a performance index related with the inter-sample behaviour. In 
this sense, each of the discretization/ estimation processes gives a measure of its quality by 
means of such an index which may measure the size of the tracking-error and/ or the size of 
the plant input for the inter-sample period. The supervisor switches on-line from the current 
process to a new one when the last is better than the former, i.e. the performance index of 
the new process is smaller than that of the current one. Moreover, the supervisor has to 
guarantee a minimum residence time between two consecutive switches in order to ensure 
the stability of the adaptive control system. 
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7. Appendix A. Proof of Lemma 1 

(i) P(k) is a monotonic non-increasing matrix sequence since P(k) - P(k - 1) < for all 
integer k>0 from (16). Moreover, if P(k 1 ) = for any integer k^O then 
P(k 1 +1)-P(k 1 ) = from (16) and then P(k) = for all integer k>k a . Thus, 

< P(k) < P(0) and P(k) asymptotically converges to a finite limit as k — » oo . 

(ii) By considering the non-negative sequence V(k) = 9° (k)P _1 (k)9°(k) and applying the 
matrix inversion lemma (Goodwin & Sin, 1984) to (16) it follows that, 

(e°(k)) 2 

V(k) - V(k - 1) = = ^— -^ < (23) 

v ' l + cp T (k-l)P(k-l)cp(k-l) v ' 

where (16) and the definition of the estimation error have been used. Then, V(k) < V(0) 
and |e (k)|<^j£M|e°(0)|<» where X max {P(0)} and X min {P(0)} denote the 

maximum and the minimum eigenvalues of P(0) , respectively. It implies that 9°(k), 
and then also 9°(k), is uniformly bounded. Then, 9(k) is also bounded since the 
modification algorithm guarantees the boundedness of M(k) provided that 9°(k) is 
bounded. On other hand, V(k) asymptotically converges to a finite limit as k — » oo from 
its definition and the fact that such a sequence is non-negative and monotonic non- 
increasing. Then, 9°(k) , and also 9°(k) , converges to a finite limit as k — » oo since P(k) 
also converges as it has been proved in (i). Then, M(k) and 9(k) also converge to finite 
limits as k — » oo . 
(iii) The boundedness and convergence of the estimation model parameters vector together 
with the non-singularity of the matrix M(k) , guaranteed by the modification algorithm, 
implies the boundedness and convergence of the vector g(k) obtained by resolution of 
equation (17). 

-l)P(i-l)cp(i 1) 

(e°(k))" 

1 + (p T (k-l)P(k-l)(p(k-l) 
(v) It follows that lim{e°(k)} =0 irrespective of the boundedness of cp(k-l) from the fact 

\ (e°(k)) 2 ] 

that linii ^ = 0. On one hand, if cp(k-l) is bounded then 

k— l + (p T (k-l)P(k-l)(p(k-l) v J 



iv) It follows that V ^-^^ = V(0) - V(k) < V(0) < oo from (23), then 

til + 9 (i- " x " /! ^~' 1 " x 



\2 

is uniformly bounded and it converges to zero as k -^ oo 
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linJ = LAJz L = o implies directly that lim(e°(k)} = . On the other 

k— l + (p T (k-l)P(k-l)(p(k-l) r y k^l v n 

\ ( e °( k )) 2 I 

hand, if cp(k-l) is unbounded then linii > = implies that 

v } k->»ll + (p T (k-l)P(k-l)(p(k-l)[ 

[|e°(k-i)| 2 ] _ T , M _ l|2 



= since e u (k) = 9 u (k-l)cp(k-l) and then lim |9 u (k-l)| =0 from 

the fact that P(k) is uniformly bounded. Thus, lim{e°(k)| = . 
(vi) Provided that the external input sequence {c(k)| is sufficiently rich such that cp(k - 1) is 
persistently exciting, 0°(k) tends to the true parameters vector 6 as k^>oo (Goodwin & 
Sin, 1984). Then, M(k) tends to M°(k) from the modification algorithm and, 
consequently, 6(k) tends to 0°(k) and e(k) tends to zero as k -^ qo . *** 

8. Appendix B. Proof of Theorem 1 

(i) A(k) is bounded since the estimation model parameters a t (k) and bj(k), and the 
controller parameters s.(k), for i e {1, 2, ..., n} and j e {1, 2, ..., n + 1}, are bounded 
thanks to the estimated parameters vector 6(k) and the multirate gains vector g(k) are 
bounded for all integer k > . The eigenvalues of A(k) are in |z| < 1 since they are the 
roots of A m (z) and A s (z), due to the designed control law, and the roots of B(z,k) 

which are within the unit circle due to the suitable adaptation of the multirate gains. 
Besides, 

£ ||A(k')-A(k'-l)| 2 <y 0+yi (k-k ) (24) 

k'=k +l 

for all integers k and k such that k > k > , and some sufficiently small positive real 
constants y and y 1 (Bilbao-Guillerna et al., 2005). Note that (24) is fulfilled with a slow 
enough estimation rate via a suitable choice of P(0) in (16) so that y 1 is sufficiently 
small. Thus, the time-varying homogeneous system x(k) = A(k-l) x(k-l) is 



k-k' 



exponentially stable and its transition matrix c|)(k,k') = Y\ A(j) satisfies |(|)(k,k')| < p 1 a ( 

j=k' 

for all k >k' where a e (0,1) and p a is a non-dependent constant (Alonso-Quesada & 

De la Sen, 2004). It follows from (21) that: 
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x(k) = <|>(k,k ) x(k )+ J ^^(^(k'l + ^k')) (25) 

k-=k 

for all integer k > k > . Then, 

|x(k)| = p 1 oj- k -|x(k )| + t Pl a5- k '(p 2+ p 3 |e(k')| + p 4 |5(k')|) (26) 

k-=k 

for some positive real constants p 2 , p 3 and p 4 , provided that the input reference 
sequence {c(k)} is bounded. It follows that lim|a i (k)-a i (k-l)| = and 

limlb j (k)-b j (k-l)| = for all i e{l, 2, ..., n} and j e{l, 2, ..., n + 1} from the 
convergence property of the estimation algorithm. Then, lim|s i (k)-s i (k-l)| = as it 
follows from the adaptive control resolution. Consequently, lim|5(k)| = 0. Besides, 
lim|e(k)| = from the estimation algorithm. Then, x(k) is bounded from (26), which 
implies that sequences {u(k)} and {y(k)| are also bounded, 
(ii) On one hand, the adaptive control law ensures that the estimated sequence {y(k)} 
matches the reference model one {y m (k)| for all integer k >0 . On the other hand, the 
estimation algorithm guarantees the asymptotic convergence of the estimation error 
e(k) to zero. Then, the output sequence {y(k)} tends to {y m (k)} asymptotically as 

k — » oo . 
(iii) The adaptive control algorithm ensures that there is no finite escapes. Then, the 
boundedness of the sequences {u(k)| and {y(k)| implies that the plant input and 

output continuous-time signals u(t) and y(t) are bounded for all t. 
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1. Introduction 

The purpose of this chapter is to redesign the standard adaptive control schemes by using 
hybrid structure composed by Model Reference Adaptive Control (MRAC) or Adaptive Pole 
Placement Control (APPC) strategies, associated to Variable Structure (VS) schemes for 
achieving non-standard robust adaptive control strategies. The both control strategies is 
now on named VS-MRAC and VS-APPC. We start with the theoretical base of standard 
control strategies APPC and MRAC, discussing their structures, as how their parameters are 
identified by adaptive observers and their robustness properties for guaranteeing their 
stability. After that, we introduce the sliding mode control (variable structure) in each 
control scheme for simplifying their design procedure. These design procedure are based on 
stability analysis of each hybrid robust control scheme. With the definition of both hybrid 
control strategies, it is analyzed their behavior when controlling system plants with 
unmodeled disturbances and parameter variation. It is established how the adaptive laws 
compensates these unmodeled dynamics. Furthermore, by using simple systems examples it 
is realized a comparison study between the hybrid structures VS-APPC and VSMRAC and 
the standard schemes APPC and MRAC. As the hybrid structures use switching laws due to 
the sliding mode scheme, the effect of chattering is analyzed on the implementation and 
consequently effects on the digital control hardware where sampling times are limiting 
factor. For reducing these drawbacks it is also discussed possibilities which kind of 
modifications can employ. Finally, some practical considerations are discussed on an 
implementation on motor drive systems. 

2. Variable Structure Model Reference Adaptive Controller (VS-MRAC) 

The VS-MRAC was originally proposed in (Hsu et al., 1989) and extensively discussed in 
(Hsu et al., 1994). The main features of this control scheme are the robustness of parameters 
uncertainties and unmodeled disturbances, as well as good transitory response. 
Consider the following first order plant 

Wis) = V - = -^- , (1) 
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where b and a are unknown or known with limited uncertainties. Admitting a reference 



model given by 



M{s) = ^ = bm , (2) 



in which k m > and a m > , the following output error variable can be defined as 

e o = y - y m • ( 3 ) 

The control objective is to force y(t) to asymptotically track the reference output signal, 
y m (t) , by regulating e Q to be zero, while keeping all the closed-loop signals uniformly 
bounded. The control law used for accomplished this is 

u = 1 y + 2 r , (4) 

which is the same as used in traditional model reference adaptive control. However, instead 
of the integral adaptive laws for the controller parameters, switching laws are proposed in 
order to improve the system transient performance and its robustness. 

If b and a are known, the ideal controller parameters ( 6 1 and 2 ) can be founded using 

the following condition 

y - = hL / (5) 

r r 

which means that our control objective is achieved, i.e., the closed-loop system behaves like 
the open-loop reference model. Consequently, the control law equation can be rewritten as 

u = 0*y + 0*r • (6) 

Analyzing (1) and (2) in the time domain, we get 

V = ~ a p V + k p u > ( 7 ) 

Vm =- a my m +V- ( 8 ) 

Adding and subtracting terms related to the ideal control parameters in (4), 

u = 6 x y + 9 2 r - 6*y - 9 2 r + 6*y + 9 2 r , (9) 
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and then grouping some terms 

« = ft - K )y + (0 2 - 02 ) r + Kv + € r > (io) 

we have 

u = 6^ + 2 r + 0*y + 0* r , (11) 



in which terms 1 and 2 are deviations of ideal controller parameters 6 1 and 2 . 
Substituting the resulting equation (11) in (7), 



„* „* 



y = -%y + b p {e lV + e 2 r + e lV + e 2 r) , (12) 

we can rewrite this equation as 

y = -a p y + b/ x y + b/ 2 r + b^y + 9 2 r) , (13) 

which results in 

V = -{% - b p el)y + b/ 2 r + b p (6 iy + 9 2 r) . (14) 

From (6), the model input r can be defined as 

u — 0-, y 

Therefore, using (11) and (15) in (8), we get 

V m =-« m V + * m r + b iJt(fo + 02r)' ( 16 ) 

°2 

Finally, comparing (14) and (16) due to the condition (5), we have the desired controller 
parameters 

e * %Z^L > (17) 
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K = b ~r- ( 18 ) 

The above desired controller parameters assure that plant output converges to its reference 

model, because b and a are known. This design criteria is named as The Matching 

Conditions. 

However, our interests are concerned with unknown plant parameters or with known plant 
parameters with uncertainties, which require the use of adaptive laws for adjusting 
controller parameters. Derivating the output error equation given in (3), 

% = y-y m ( 19 ) 

and using the condition (5), with equations (8), (16) and (19), we get 

- a mV + K r + -t(0i2> + fa) - (- a m y m + b m r) , (20) 



e, 



o 



w 2 



which can be rearranged as 



% = -* m (y-y m ) + h(hy + hr). (21) 



Thus, 



e — a m e 



^(Kv + 9 2 r). (22) 



Now, consider the Lyapunov function candidate given by 

V(e ) = ^e%>0, (23) 

and its respective first time derivative 

V(e ) = e e . (24) 

By substituting (22) in (24), we obtain the following equation 
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n%) 



"Vo+tH^ + M 

6 2 



^ / 



(25) 



that can be rewritten as 



He ) 



vl 



0*)e y + (6 2 -e*)e r] 



(26) 



Using the switching laws, 



6 1 = -O^gn^y) , 
°2 = -0 2 sgn(e Q r) f 



(27) 
(28) 



we obtain, 



d 2 



% |e r | +9 2 e r)\. 



(29) 



If the conditions 6 1 >\ 1 \ and 6 2 >\ 6 2 \ are satisfied, the terms with indefinite signals 
in (29) are dominated, and then 



V(e ) < -a m el < 



(30) 



which guarantees that e Q = is a globally asymptotically stable (GAS) equilibrium point, 
because (30) is a negative definite function. 

3. Variable Structure Adaptive Pole Placement Control (VS-APPC) 

As the VS-MRAC, the VS-APPC is the hybrid control structure obtained from the 
association of Pole Placement Control (PPC) together with Variable Structure (VS). 
Therefore, the theoretical development of this section starts from PPC control scheme and 
then we introduce the VS concepts for achieving the proposed VS-APPC. 
Considering the single input/ single output (SISO) LTI plant 



y = G(s)u, 



(31) 



in which 
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G(s) = M = K-^- 1 +- + b l . + b (32) 

R(s) s n + a n _ 1 5 n " 1 + ... + a x s + a 

there are, as plant parameters, 2n elements, which are the coefficients of the numerator and 
denominator of transfer function G(s) . We can define the vector 6 as 

T 



\k-l • • • b l b a n-l • • • a i %] • ( 33 ) 



From this, the following constraints must be observed: 

51. R(s) is a monic polynomial whose degree n is known. 

52. Z(s) , R(s) are coprime and degree (Z) < n . 

Assumptions (SI) and (S2) allow Z(s) , R(s) to be non-Hurwitz in contrast to the MRC 

(Model Reference Control) case, where Z(s) is required to be Hurwitz. 

We can also extend the PPC scheme for including the tracking objective, where output y is 

required to follow a certain class of reference signals r , by using the internal model 

principle (Ioannou & Sun, 1996). The uniformly bounded reference signal is assumed to 

satisfy 

QJs)r = 0, (34) 

where Q (s) , the internal model of r , is a known monic polynomial of degree q with non- 
repeated roots on the;&>-axis and satisfies 

53. Q (s) , Z(s) are coprime. 
Considering the control law given by 

Qm(s)L(s)u = -P(s)y + M(s)r , (35) 

where P(s) , M(s) , L(s) are polynomials (with L(s) monic) of degree q + n - 1 , 
q + n - 1 e n - 1, respectively, and Q m (s) satisfies (34) and assumption (S3). 
Applying (35) to the plant (31), we obtain the closed-loop plant equation 

Z(s)M(s) , (36) 



Qm(s)L(s)R(s) + P(s)Z(s) 
whose characteristic equation is 
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Qm(s)L(s)R(s) + P(s)Z(s) = 0, (37) 

and has order 2n + q — 1 . The objective now is chosen P(s) , L(s) such that 

Qm(s)L(s)R(s) + P(s)Z(s) = A*(s) (38) 

is satisfied for a given monic Hurwitz polynomial A (s) of degree 2n + q — 1 . Because of 

assumptions S2 e S3 which guarantee that Q m (s) , i?(s) , Z(s) are coprime, there is a 

solution so that L(s) and P(s) satisfy (38) and this solution is unique (Ioannou & Sun, 

1996). 

Using (38), the closed-loop is described by 

Z(s)M(s) 

y= L; r - ( 39 ) 

A (s) 
Similarly, from the plant (31) and the control law (35) and (38), we obtain 



R(s)M(s) 
u= v '- w r . 40 

A\s) 

■c . i , , , Z(s)M(s) R(s)M(s) ., „ 

Because r is uniformly bounded and , are proper with stable poles, 

A(s) A(s) 

y and u remain bounded whenever t — > oc for any polynomial M(s) of degree 
n -\- q — 1 (Ioannou & Sun, 1996). Therefore, the pole placement objective is achieved by 
the control law (35) without having any additional restrictions in M(s) and Q m (s) . When 
r = , (39) and (40) imply that y and u converge to zero exponentially fast. On the other 
hand, when r ^± , the tracking error e = y — r is given by 

Z(s)M(s) - A*(s) Z(s) r „, . „, ,, L(s)R(s) „ , N 
e = w w K - 1 r = —^^[Mts) - P(s)]r w — Qm(s)r . 41 

A(s) A(s) x A(s) 

In order to obtain zero tracking error, the equation above suggests the choice of 
M(s) = P(s) to cancel its first term, while the second term can be canceled by using (34). 
Therefore, the pole placement and tracking objective are achieved by using the control law 

Qm(s)L(s)u = -P(s) (y-r), (42) 
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which is implemented as shown in Fig. 1 using n + q - 1 integrators for the controller 
realization. An alternative realization of (42) is obtained by rewriting it as 



A-LQm P, N 

a u-—(y-r), 

A A 



(43) 



where A is any monic Hurwitz polynomial of degree n + q - 1 . 



r + 



o- 



n- 



*W 



&.WW 



GW 



Fig. 1. Block diagram of pole placement control. 

The PPC design supposes that the plant parameters are known, what not always is true or 
possible. Therefore, integral adaptive laws can be proposed for estimating these parameters 
and then used with PPC schemes. This new strategy is called Adaptive Pole Placement 
Controller (APPC), where the certainty equivalence principle guarantees that the output plant 
tracks the reference signal r , if the estimates converge to the desired values. In this section, 
instead of these traditional adaptive laws, switching laws will be used for the the first order 
plant case, according to (Silva et ah, 2004). 
Consider the plant, 



y 



s + a 



-u , 



and its respective time domain equation, 

y = -ay + bu , 



(44) 



(45) 



where the parameters a and b are unknown or known with uncertainties. Let be a m a 
positive constant, we may write (45) by adding and subtracting the term a m y , 



V = ~ a mV + ( a m - °)y + bu • 
A model for the plant may be written as 

y = ~ a m y + ( a m - d )y + ^ * 



(46) 



(47) 
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where a and b are estimates for a and b , respectively (Ioannou & Sun, 1996). 
We define the estimation error e Q as 



e =y-y, (48) 

and with (46) and (47), we get 

^o = ~ a m e o + ay-bu, (49) 

where 

a = a — a , (50) 

6=6—6. (51) 

Choosing the following Lyapunov function candidate, 

^( e o) = T^o 2 >°- (52) 

we have 

V(e ) = e e , (53) 

which can be rewritten using (49), 

V(e ) = -a m e 2 + ae y - be u . (54) 

Expanding the above equation with (50) and (51), 

V(e ) = -a m e 2 + (a - a)e y - (b - b)e Q u , (55) 

and then using the switching laws, 

a = -asga(e Q y), (56) 

b = b sgn(e ^), (57) 

we get, 
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He Q ) 



(a\e Q y\ + ae y) - (b \e Q u\ - be Q u) . 



(58) 



Finally, if the conditions a > \a\ and b > \b\ are satisfied, 



V(e ) < -a m e 2 < 0, 



(59) 



which guarantees that e Q = is a globally asymptotic stable (GAS) equilibrium point. 
Moreover, if we follow a similar procedure described in (Hsu & Costa, 1989), we can prove 
that e Q = reaches the sliding surface in a finite time t* (e Q = , Vt > t*). 

4. Application on a Current Control Loop of an Induction Machine 

To evaluate the performance of both proposed hybrid adaptive schemes, we use an 
induction machine voltage x current model as an experimental plant. The voltage equations 
of the induction machine on arbitrary reference frame can be presented by the following 
equations: 



u sd 



sq 



r. 


+ 


h 


-^| 












T r ) 


K 


+ 


K 


-*,) 












T r J 



H, + <Tl. 



"M 



it 
di 9 



dt 



g s sd 



Us 


-*s) 
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L ) 


Us 


-°ls] 


^ 


L J 



u r tf 



rd 



, (60) 
/ (61) 



where v g sd , v g , i g sd and i g are dq axis stator voltages and currents in a generic reference 

frame, respectively; r s , I and I are the stator resistance, stator inductance and mutual 

inductance, respectively; uo and uo r are the angular frequencies of the dq generic reference 

frame and rotor reference frame, respectively; a = 1 — I / II and r = I / r r are the 
leakage factor and rotor time constant, respectively. 

The above model can be simplified by choosing the stator reference frame [uo = 0). 
Therefore, equations (60) and (61) can be rewritten as 



v s = r i S i + al 

sd sr sd ' 



di 



'sd 

dt 



^sd 



(62) 
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d<n 


<, 


= rjl + <tL -^ 


sq 


srsq S d 



(63) 



where s is the superscript related to the stator reference frame, r gr = r g + (l g — al s ) / r r , 
e s gd and e s arefcems of the dq machine phases given by 



l sd 



uo 6 s 

r^rq 


+ 





L 



(64) 



and 



sq 






a - o 



i 



(65) 



The current x voltage transfer function of the induction machine can be obtained from (62) 
and (63) as 



KM KM i/r„ 



VfM V*(8) sr s +l 



(66) 



where r s = al s / r sr , V*(s) = V^s) - E'Js) and V£(s) = V^s) - E^s) . The fcems 

E s sd (s) and E s (s) are considered unmodeled disturbances to be compensated by the 

control scheme. 

Analyzing the current x voltage transfer functions of a standard machine, we can observe 

that the time constant r g has parameters which vary with the dynamic behavior of 

machine. Moreover, this plant has also unmodeled disturbances. This justifies the use of this 
control plant for evaluating the performance of proposed control schemes. 



5. Control System 

Fig. 2 presents the block diagram of a standard vector control strategy, in which the 
proposed control schemes are employed for induction motor drive. Block RFO realizes the 

vector rotor field oriented control strategy. It generates the stator reference currents i 8 ^ and 

i 8 * , angular stator frequency a;* of stator reference currents from desired reference torque 

T* , and reference rotor flux <j>* , respectively. Blocks VS-ACS implement the proposed 
robust adaptive current control schemes that could be the VS-MRAC strategy or the VS- 
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APPC strategy. Both current controllers are implemented on the stator reference frame. 
Block dq s / 123 transforms the variables from dq s stationary reference frame into 123 

stator reference frame. 

Generically, the current-voltage transfer function given by equation (66) can be rewritten as 



isdq\ ) Tr' S /\ t/s/ \ 



^» y>) s + a s 



(67) 



in which 6=1/ al and a g = 1 / r . In this model, the fcems e s gd and e s are considered 



sq 



unmodeled disturbances to be compensated by current controllers. The parameters a g and 

b s are known with uncertainties that can be introduced by machine saturation, temperature 
changes or loading variation. 




Fig. 2. Block diagram of the proposed IM motor drive system. 

5.1 VS-MRAC Scheme 

Consider that the linear first order plant of induction machine current-voltage transfer 

function W { s sd given by (67) and a reference model characterized by transfer function 



MLM = K 



DJs) 



(68) 



which attends for the stability constraints that is the constant b g in (67) and b e should have 
positive sign, as mentioned before. The output error can be defined as 



6 = 2 — 2 

Osdq sdq mdq ' 



(69) 
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where i s md ( i s md and i s ) are the outputs of the reference model. The tracking of the model 

control signal ( i s sd = i s md or i s = i s ) is reached if the input of the control plant is defined 
as 

< dq = V sdq + %A (70) 

where 0* d ( 0* ) and 0^ d ( $L ) are the ideal controller parameters, that can be only 
determined if W* 8d (s) is known. According to section 2, they can be determined as 



%=«T 9 =^-r A ' (71) 



and 

°2d ~ % ~ 7" 



(72) 



Once H^J^ (5) is not known, the controllers parameters ld (t) and 2d (t) are updated by 
using switching laws as 

e id q = -% s 9<e s 0sdq y s isdq ) (73) 

where i = [1,2] and y s sd is the reference currents i* d or the output currents^, , and 

6 id > 0* d are upper bounds which are assumed to be known, and the signal-function 
sgn is defined as 

[l if x > 

**»(*) = |-i if x <o- (74) 

Introducing nominal values of controller parameters idq{nom) (ideally ^ (nom) = ^).Itis 
convenient to modify the control plant input given by (70) for the following 
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sdq 



U ldq 

L sdq 

V 2dq 
■s* 

sdq 



with e T = \o vldq e sldq e v2dq o s2dq J , Q T nom = [ 



I ®sldq(nom) ^s2dq{nom) I anci 



(75) 



^ 



Au 



ldq 



sdq 



*2A, = Av 2d g + K, 



(76) 



in which 



6 s2dq = -Qs2d q S 9n(4sd/s*d q ) + 



sldq(nom) 
s2dq(nom) 



(77) 



and 



vldq 

9 

J v2dq 



9 vldq S 9<^ 
Q V 2dq S 9< e i 



'Qisdq V ldq) 
0sdq V 2dq- 



(78) 



where -, , , 9 , , ,-, , and 6 , are the controller parameters, 6 -, , , x and 6 , , n 

slag ' szag ' i>la# i>zag * ' sldqynom) szdqynom) 

are the nominal parameters of the controller, and v ld and v 2d are the system plant input 

and output filtered signals, respectively. The constants 6 gld or s2d is chosen by 
considering that 



"sldq > \^sldq 

~Q >> \ft* 
U s2dq ^ \ U s2dq 



sldq(nom) 



0, 



(79) 



s2dq(nom) 



The input and output filters given by equation (76) are designed as proposed in (Narendra 
& Annaswamy, 1989). The filter parameter A is chosen such that N (s) is a factor of 
det(s/ — A) . Conventionally, these filters are used when the system plant is the second 
order or higher. However, it is used in the proposed controller to get two more parameters 
for minimizing the tracking error e^ , . 
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Fig. 3. Block diagram of proposed VS-MRAC current controller. 

The block diagram of the VS-MRAC control algorithm is presented in Fig. 3. The proposed 
control scheme is composed by VS for calculating the controller parameters and a MRAC for 
determining the system desired performance. The VS is implemented by the block Controller 

Calculation, in which Equations (77) and (78) together are employed for determining 6 sld , 
9 s2d , 6 vld and v2d . These parameters are used by Controller blocks for generating the 
control signals v s sd . To reduce the chattering at the output of controllers, input filters, 
represented by blocks V id (s) and V-(s) are employed. They use filter model represented 
by Eqs. (76). These filtered voltages feed the IM which generates phase currents i s sd which 
are also filtered by filter blocks V od (s) and V (s) and then, compared with the reference 
model output i s md for generating the output error e^ , . The reference models are 



implemented by two blocks which implements transfer functions (68). The output of these 
blocks is interconnected by coupling terms 



• u o I8 m and u o IS md' respectively. This 
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approach used to avoid the phase delay between the input ( I* d ) and output ( I s \ ) of the 
reference model. 

5.1.1 Design of the Controller 

To design the proposed VS-MRAC controller, initially is necessary to choose a suitable 

reference model M- sd (s) . Based on the parameters of the induction machine used in 
present study, given in Table 1, the reference model employed is 

M ° (s) = 55 ° , (80) 

lsdqK ' s + 550 V ' 

From this reference model, the nominal values can be determined by using equations (71) 
and (72) which results in lsd{nom) = lsq{nom) = 3.7 and 2sd{nom) = 2sq{nom) = 55. 

Considering the restrictions given by (79), the parameters 6 gld and 9 s2d , chosen for 

achieving a control signal with minimum amplitude are sld =0.37 and s2d = 5.5 . It 

is important to highlight that choice criteria determines how fast the system converges to 
their references. Moreover, it also determines the level of the chattering verified at the 
control system after its convergence. As mentioned before the use of input and output filters 
are not required for control plant of fist order. They are used here for smoothing the control 
signal. Their parameters was determined experimentally, which results in 

A = l/0 vld = 6 vld =2.0 and v2d = v2 =0.1. This solution is not unique and 

different adjust can be employed on these filters setup which addresses to different overall 
system performance. 

5.2 VS-APPC Scheme 

The first approach of VS-APPC in (Silva et al. f 2004) does not deal with unmodeled 
disturbances occurred at the system control loop like machine ferns. To overcome this, a 
modified VS-APPC is proposed here. 
Let us consider the first order IM current-voltage transfer function given by equation (67). 

The main objective is to estimate parameters a s and b s to generate the inputs v sd and v 
so that the machine phase currents i s sd and i s following their respective reference currents 
i s sd and i s d and, the closed loop poles are assigned to those of a Hurwitz polynomials 
A*(s) given by 

A*(s) = s 3 + a 2 s 2 + a{ s + a* , (81) 
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where coefficients a 2 , a£ and Qq determine the closed-loop performance requirements. 
To estimate the parameters a g and b g , the respective switching laws are used 

a s = -a s sgn(e s 0sdq i s sdq ) , (82) 

K = K*9<4s dq < dq ) > (83) 

with the restrictions a g > \a\ and b g > 6 I satisfied, as mentioned before. The pole 

placements and the tracking objectives of proposed VS-APPC are achieved, if the following 
control law is employed 

QJs)L(s)V: dq (s) = -P(s)(i: dq - I") (84) 

which addresses to the implementation of the controller transfer function 

Qm(*) L (s) 

The polynomial Q m (s) is choose to satisfy Q m (s)I s J(s) = Q m (s)I s s *(s) = . For the IM 

current-voltage control plant (see equation (67)) and considering that the VS-APPC control 
algorithms are implemented on the stator reference frame, which results in sinusoidal 

reference currents, a suitable choice for the controller polynomials are Q m (s) = s + a;* 

(internal model of sinusoidal reference signals i* d and i* ), L(s) = 1 and 

P(s) = p 2 s 2 + p x s + p , where u* is the angular frequency of reference currents. This 
choice results in a current controller with the following transfer functions 

oj» - cw = 4f!_t4£±JiL m 

where angular frequency a;* is generated by vector RFO control scheme and coefficients p 2 , 
p 1 and p are determined by solving the Diophantine equation for desired Hurwitz 
polynomial A* (see equation (81)) as follows 

at — a c 
p 2 = ^-^ (87) 
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Pl = x ° (88) 

K 

* *2 ^ 

a n — w n a„ 

p = , ( 89 ) 

To avoid zero division on the equation (87)-(89), the switching law (83) is modified by 

K =K s 9n{el dq v s sdq ) + b s(nom) (90) 

in which ^ s ( nom \ is the nominal values of b s and the stability restriction 
becomes b > \b —b ( J. 

5 | s s{nom) | 

The control signals v s sd and ?/ generated at the output of the proposed controller VS-APPC 
can be derived from equation (86) which results in the following state-space model 

%lsdq = X 2sdq + Pl £ sdq ( 91 ) 

*U = -" 2 o< sdq + (Po " "feKfe ( 92 ) 

^ = *U + P 2 < d? (93) 

where £ s sdq (t) = & — ^ is the current error that is calculated from the measured 

quantities issued by data acquisition plug-in board as described next. Therefore, to generate 
the output signal of the controllers it is necessary to solve the equations (91)-(93). 
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Fig. 4. Block diagram of proposed VS-APPC current controller. 

The block diagram of the VS-APPC control algorithm for the machine current control loop is 
presented in Fig. 4. The proposed adaptive control scheme is composed a SMC parameter 
estimator and a machine current control loop subsystems. The SMC composed by blocks 
system controller and plant model identifies the dynamic of the IM current-voltage model. 

The output of this system generates the estimative of machine phase currents i g s d and i g s . 

The control loop subsystem composed by system controller and IM regulates the machine 

phase currents i s sd and i s and compensate the disturbances e s sd and e s . The comparison 

between the estimative currents ( i gd and i^ ) and the machine phase currents ( i s sd and i s ) 

determines the estimation errors e^ sd and ei . These errors together with machine voltages 

V s i and?/ , and VS-APPC algorithm set points a" , fo and b , n are used for calculating 

sa sq' o r s ' s s[nom) o 

parameter estimative d g andb g/ from the use of equations (82) and (90). These estimates 
update the plant model of the IM and are used by the controller calculation for together 
with, the coefficients of the desired polynomial A* and angular frequency a;* , determine 

the parameters of the system controller p 2 , p 1 and p . The introduction of the IMP into 

the controller modeling avoids the use of stator to synchronous reference frame 
transformations. With this approach, the robustness for unmodeled disturbances is 
achieved. 



5.2.1 Design of the Controller 

To design the proposed VS-APPC controller is necessary to choose a suitable polynomial 

and to determine the controllers coefficients p 2 , p x , and p . A good choice criteria for 
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accomplishing the bound system conditions, is to define a polynomial which roots are 
closed to the control plant time constants. The characteristics of IM used in this work are 
listed in the Table 1. The current-voltage transfer functions for dq phases are given by 



U») 



10 



VL(s) s + 587 



(94) 



A possible choice for suitable polynomial A* (s) can be 



% (a) = (s + 587) 3 



(95) 



According to Equations (82), (90) and (87)-(89), and based on the desired polynomial (95), 
the estimative of the parameters of VS-APPC current controllers can be obtained as 



P-2 



Pi 



Po 



1761 - a. 



1033707 - <J 



202262003 - to z d s 



(96) 
(97) 
(98) 



To define the coefficients of the switching laws it is necessary to take into account together 
the stability restrictions a g > a I and b g > \l 



s{nom) 



. Based on the simulation and the 



theoretical studies, it can be observed that the magnitude of the respective switching laws 
(a and b ) determine how fast the VS-APPC controllers converge to their respective 

references. However, the choice of greater values, results in controllers outputs ( v sd and v ) 

with high amplitudes, which can address to the operation of system with nonlinear 
behavior. Thus, a good design criteria is to choose the parameters closed to average values 

of control plant coefficients a s and b s . Using this design criteria for the IM employed in this 

work, the following values are obtained & s ( nom \ = 9 , b s =2 and a s = 600 . This solution is 

not unique and different design adjusts can be tested for different induction machines. The 
performance of these controllers is evaluated by simulation and experimental results as 
presented next. 



r s = 31.0ft 


r = 27.2ft 

r 


l s = 0.8042# 


l r = 0.7992# 


l m = 0.7534tf 


J = 0.0133%.m 2 


F = 0.0U6kg.m 


P = 2 
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Table 1. IM nominal parameters 

6. Experimental Results 

The performance of the proposed VS-MRAC and VS-APPC adaptive controllers was 
evaluated by experimental results. To realize these tests, an experimental platform 
composed by a microcomputer equipped with a specific data acquisition card, a control 
board, IM and a three-phase power converter was used. The data of the IM used in this 
platform, are listed in Table 1. The command signals of three-phase power converter are 
generated by a microcomputer with a sampling time of 100 (is . The data acquisition card 

employs Hall effect sensors and A/D converters, connected to low-pass filters with cutoff 
frequency of f = 2.5 kHz . Figures 5(a) and 5(b) show the experimental results of VS- 
MRAC control scheme. In these figures are present the graphs of the reference model phase 
currents i s md and i s superimposed to the machine phase currents i s sd and i s . In this 

experiment, the reference model currents are settled initially in I s , = 0.8^4 and 

f = 30 Hz . At the instant t = 0.155 , each reference model phase currents is changed by 

I s md = 0.2^4 . In these results it can be observed that the machine phase currents follow the 
model reference currents with a good transient response and a current ripple 
ofAiV ~ 0.05^4. Figures 6-7 present the experimental results of VS-APPC control 

scheme. In the Fig. 6(a) are shown the graph of reference phase current i s sd superimposed 
by its estimation phase current i* d . In this test, similar to the experiment realized to the VS- 
MRAC, the magnitude of the reference current is settled in I s sd = 0.8^4 and at instant 

t = 0.15s, it is changed by/*? = 0.2^4 . These results show that the estimation scheme 
employed in the VS-APPC estimates the machine phase current with small current ripple. 
Figure 6(b) shows the graphs of the reference phase current i s sd superimposed by its 

corresponded machine phase current i s sd . In this result, it can be verified that the machine 

phase current converges to its reference current imposed by RFO vector control strategy. 
Similar to the results presented before, Fig. 7(a) presents the experimental results of 

reference phase current i s * superimposed by its estimation phase current i* and Fig. 7(b) 
shows the reference phase current i 8 * superimposed by its corresponded machine phase 

current i s . These results show that the VS-APPC also demonstrates a good performance. In 

comparison to the VS-MRAC, the machine phase currents of the VS-APPC present small 
current ripple. 
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1. Introduction 

Networked control systems (NCSs) are a type of distributed control systems, where the 
information of control system components (reference input, plant output, control input, etc.) 
is exchanged via communication networks. Due to the introduction of networks, NCSs have 
many attractive advantages, such as reduced system wiring, low weight and space, ease of 
system diagnosis and maintenance, and increased system agility, which motivated the 
research in NCSs. The study of NCSs has been an active research area in the past several 
years, see some recent survey articles (Chow & Tipsuwan, 2001; Hespanha & Naghshtabrizi, 
2007; Yang, 2006) and the references therein. On the other hand, the introduction of 
networks also presents some challenges such as the limited feedback information caused by 
packet transmission delays and packet loss; both of them are due to the sharing and 
competition of the transmission medium, and bring difficulties for analysis and design for 
NCSs. The information transmission delay arises from by the limited capacity of the 
communication network used in a control system, whereas the packet loss is caused by the 
unavoidable data losses or transmission errors. Both the information transmission delay and 
packet loss may result in randomly missing output measurements at the controller node, as 
shown in Fig. 1. So far different approaches have been used to characterize the limited 
feedback information. For example, the information transmission delay and packet losses 
have been modeled as Markov chains (Zhang et al., 2006). The binary Bernoulli distribution 
is used to model the packet losses in (Sinopoli et al., 2004; Wang et al., 2005 a & 2005 b). 
The main challenge of NCS design is the limited feedback information (information 
transmission delays and packet losses), which can degrade the performance of systems or 
even cause instability. Various methodologies have been proposed for modeling, stability 
analysis, and controller design for NCSs in the presence of limited feedback information. A 
novel feedback stabilization solution of multiple coupled control systems with limited 
communication is proposed by bringing together communication and control theoretical 
issues in (Hristu & Morgansen, 1999). Further the control and communication codesign 
methodology is applied in (Hristu- Varsakelis, 2006; Zhang & Hristu- Varsakelis, 2006) - a 
method of stabilizing linear NCSs with medium access constraints and transmission delays 
by designing a delay-compensated feedback controller and an accompanying medium 
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access policy is presented. In (Zhang et aL, 2001), the relationship of sampling time and 
maximum allowable transfer interval to keep the systems stable is analyzed by using a 
stability region plot; the stability analysis of NCSs is addressed by using a hybrid system 
stability analysis technique. In (Walsh et aL, 2002), a new NCS protocol, try-once-discard 
(TOD), which employs dynamic scheduling method, is proposed and the analytic proof of 
global exponential stability is provided based on Lyapunov's second method. In (Azimi- 
Sadjadi, 2003), the conditions under which NCSs subject to dropped packets are mean 
square stable are provided. Output feedback controller that can stabilize the plant in the 
presence of delay, sampling, and dropout effects in the measurement and actuation 
channels is developed in (Naghshtabrizi & Hespanha, 2005). In (Yu et aL, 2004), the authors 
model the NCSs with packet dropout and delays as ordinary linear systems with input 
delays and further design state feedback controllers using Lyapunov-Razumikhin function 
method for the continuous-time case, and Lyapunov-Krasovskii based method for the 
discrete-time case, respectively. In (Yue et aL, 2004), the time delays and packet dropout are 
simultaneously considered for state feedback controller design based on a delay-dependent 
approach; the maximum allowable value of the network-induced delays can be determined 
by solving a set of linear matrix inequalities (LMIs). Most recently, Gao, et aL, for the first 
time, incorporate simultaneously three types of communication limitation, e.g., 
measurement quantization, signal transmission delay, and data packet dropout into the 
NCS design for robust H w state estimation (Gao & Chen, 2007), and passivity based 
controller design (Gao et aL, 2007), respectively. Further, a new delay system approach that 
consists of multiple successive delay components in the state, is proposed and applied to 
network-based control in (Gao et aL, 2008). 

However, the results obtained for NCSs are still limited: Most of the aforementioned results 
assume that the plant is given and model parameters are available, while few papers 
address the analysis and synthesis problems for NCSs whose plant parameters are 
unknown. In fact, while controlling a real plant, the designer rarely knows its parameters 
accurately (Narendra & Annaswamy, 1989). To the best of our knowledge, adaptive control 
for systems with unknown parameters and randomly missing outputs in a network 
environment has not been fully investigated, which is the focus of this paper. 
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Fig. 1. An NCS with randomly missing outputs. 



It is worth noting that systems with regular missing outputs - a special case of those with 
randomly missing outputs - can also be viewed as multirate systems which have uniform 
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but various input/ output sampling rates (Chen & Francis, 1995). Such systems may have 
regular-output-missing feature. In (Ding & Chen, 2004a), Ding, et al. use an auxiliary model 
and a modified recursive least squares (RLS) algorithm to realize simultaneous parameter 
and output estimation of dual-rate systems. Further, a least squares based self-tuning 
control scheme is studied for dual-rate linear systems (Ding & Chen, 2004b) and nonlinear 
systems (Ding et al., 2006), respectively. However, network-induced limited feedback 
information unavoidably results in randomly missing output measurements. To generalize 
and extend the adaptive control approach for multirate systems (Ding & Chen, 2004b; Ding 
et al., 2006) to NCSs with randomly missing output measurements and unknown model 
parameters is another motivation of this work. 

In this paper, we first model the availability of output as a Bernoulli process. Then we 
design an output estimator to online estimate the missing output measurements, and further 
propose a novel Kalman filter based method for parameter estimation with randomly 
output missing. Based on the estimated output or the available output, and the estimated 
model parameters, an adaptive control is proposed to make the output track the desired 
signal. Convergence of the proposed output estimation and adaptive control algorithms is 
analyzed. 

The rest of this paper is organized as follows. The problem of adaptive control for NCSs 
with unknown model parameters and randomly missing outputs is formulated in Section 2. 
In Section 3, the proposed algorithms for output estimation, model parameter estimation, 
and adaptive control are presented. In Section 4, the convergence properties of the proposed 
algorithms are analyzed. Section 5 gives several illustrative examples to demonstrate the 
effectiveness of the proposed algorithms. Finally, concluding remarks are given in Section 6. 
Notations: The notations used throughout the paper are fairly standard/ E ' denotes the 
expectation. The superscript ' T ' stands for matrix transposition; A max/min (X) represents the 

Maximum/ minimum eigenvalue of X; |X|=det(X) is the determinant of a square matrix 

X ; \\x f = tr (XX T ) stands forthe trace of XX T . If 3 S e R + and k e Z + , | f(k) \< S g(k) 

for k > k , then f(k) = 0(g(k)) ; if f(k) / g(k) -> for k -> oo , then f(k) = o(g(k)) . 



2. Problem Formulation 

The problem of interest in this work is to design an adaptive control scheme for networked 
systems with unknown model parameters and randomly missing outputs. In Fig. 2, the 
output measurements y k could be unavailable at the controller node at some time instants 

because of the network-induced limited feedback information, e.g., transmission delay 
and/ or packet loss. The data transmission protocols like TCP guarantee the delivery of data 
packets in this way: When one or more packets are lost the transmitter retransmits the lost 
packets. However, since a retransmitted packet usually has a long delay that is not desirable 
for control systems, the retransmitted packets are outdated by the time they arrive at the 
controller (Azimi-Sadjadi, 2003; Hristu-Varsakelis & Levine, 2005). Therefore, in this paper, 
it is assumed that the output measurements that are delayed in transmission are regarded as 
missed ones. 

The availability of y k can be viewed as a random variable y k . y k is assumed to have Bernoulli 
distribution: 
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Hr k r s ) = 1 Er k Er s tor k*s, 



Prob(/ fc 



1 -ju k , else if y k =0, 



(1) 



where < ju k < 1 . 

Consider a single-input-single-output (SISO) process (Fig. 2): 

A z x k = B z u k , y k = * k + v k 



(2) 



where u k is the system input, y k the output and v k the disturbing white noise with variance 
r v . A z and B z are two backshift polynomials defined as 

A z = 1 + fl a z _1 + a" 2 + • • • + a n z~ n ° , 
B z = b + \z~ x + b 2 z~ 2 +--- + K b z""" . 

The polynomial orders n a and n h are assumed to be given. Eqn. (2) can be written 
equivalently as the following linear regression model: 

y k = ( pl k o+v k/ (3) 



where 

% k = \_~ X k -l ~ X Jc-2 X k -n a U Jc U k -l ' ' ' U Jc-n b J ' 

6 = \a 1 a 1 "- a na b b 1 • • • b nb ] . 

Vector <p ok represents system's excitation and response information necessary for parameter 
estimation, while vector 6 contains model parameters to be estimated. 
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Fig. 2. Output-error (OE) model structure. 



For a system with the output-error (OE) model placed in a networked environment subject 
to randomly missing outputs, the objectives of this paper are: 

1. Design an output estimator to online estimate the missing output measurements. 

2. Develop a recursive Kalman filter based identification algorithm to estimate unknown 
model parameters. 
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3. Propose an adaptive tracking controller to make the system output track a given 
desired signal. 

4. Analyze the convergence properties of the proposed algorithms. 

3. Parameter Estimation, Output Estimation, and Adaptive Control Design 

There are two main challenges of the adaptive control design for a networked system as 
depicted in Fig. 1: (1) randomly missing output measurements; (2) unknown system model 
parameters. Therefore, in this section, we first propose algorithms for missing output 
estimation and unknown model parameter estimation, and then design the adaptive control 
scheme. 

3.1 Parameter estimation and missing output estimation 

Consider the model in (3). It is shown by (Cao & Schwartz, 2003) and (Guo, 1990) that the 
corresponding Kalman filter can be conveniently used for parameter estimation. In 
combination with an auxiliary model, the Kalman filter based parameter estimation 
algorithm for an OE model is given by 

4 = 4-!+^(y*-^Ai)' (4) 

v _ gq-igq ( 5 ) 

r v + <P a ,k P a,k-l<Pa,k 

p =P P a,k-l<Pa,k<Plk P a,k-l ^ 

*■ a,k *■ a,k-l T rt ' 

r v+<Pa,k V a,k-r%k 

*a*=fiA> (7) 



<Pa.k = [~ X a,k-l 



T ( 8 ) 

X a,k-2'" X a,k-n a U k U k-1 " ' U k-n h ' 



where 6 k represents the estimated parameter vector at time instant k . 

It is worth to note that the above algorithm as shown in (4) -(8) cannot be directly applied to 
the parameter estimation of systems with randomly missing outputs in a network 
environment, as y k in (4) may not be available. This motivates us to develop a new 
algorithm that can simultaneously online estimate the unavailable missing output and 
estimate system parameters under the network environment. The proposed algorithm 
consists of two steps. 

Step 1: Output estimation 

Albertos, et al. propose a simple algorithm that uses the input-output model, replacing the 
unknown past values by estimates when necessary (Albertos et al., 2006). Inspired by this 
work, we design the following output estimator: 
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z*=r*y* + (i -/*)&/ ( 9 ) 

with 

In (9), / fc is a Bernoulli random variable used to characterize the availability of y k at time 
instant k at the controller node, as defined in (1). With the time-stamp technique, the 
controller node can detect the availability of the output measurements, and thus, the values 
of y k (either 1 or 0) are known. The knowledge of their corresponding probability ju k is not 

used in the designed estimator. The structure of the designed output estimator is intuitive 
and simple yet very effective, which will be seen soon from the simulation examples. 

Step 2: Model parameter estimation 

Replacing y k in the algorithm (4) -(8) by z k , defining a new cp k , and considering the random 

variable y k , we readily obtain the following algorithm: 

4=4-i+k*(2 1 - % t 4- 1 ), ( 10 ) 

K _ fi-ift (H) 

P _ P .. p k -iwlP k -i ( 12 ) 

r v +<P k P k -i<Pk 

x b ,k = <P T k&k' ( 13 ) 

9k = [~ x b,k-i ~ x b ,k-2 -••- x b,k-n a u k u k _ x ■ • • u k _ nh ] T . ( 14 ) 

Remark 3.1. Consider two extreme cases. If the availability sequence {y lf ~ -,y k } constantly assumes 
2, then no output measurement is lost, and the algorithm above will reduce to the algorithm (4)-(6). 
On the other hand, if the availability sequence y k constantly takes 0, then all output measurements 
are lost, and the parameter estimates just keep the initial values. 

3.2 Adaptive control design 

Consider the tracking problem. Let y rk be a desired output signal, and define the output 

tracking error 

Ck-=y k -yr,k- 
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If the control law u k is appropriately designed such that y rk = q>l k 6 , then the average 
tracking error z k approaches zero finally. Replacing by 6 k _ x and <p ok by cp k yields 



i=l i=0 

~ ~ a i,k-l X b,k-l a n a ,k-l X b,k-n a + ^0,k-l U k + 

Therefore, the control law can be designed as 

1 



■K b ,k-l U k-n b - 



(15) 



The proposed adaptive control scheme consists of the missing output estimator [Equation 
(9)], model parameter estimator [Equations (10-14)], and the adaptive control law [Equation 
(15)]. The overall control diagram is shown in Fig. 3. 

4. Convergence Analysis 

This section focuses on the analysis of some convergence properties. Some preliminaries are 
first summarized to facilitate the following convergence analysis of parameter estimation in 
(10)-(12) and of output estimation in (9). Inspired by the work in (Chen & Guo, 1991; Ding & 
Chen, 2004a; Ding et al., 2006), the convergence analysis is carried out under the stochastic 
framework. 
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4.1 Preliminaries 

To facilitate the convergence analysis, directly applying the matrix inversion formula (Horn 
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& Johnson, 1991) 

(A + BCD)- 1 = A- 1 - A-'BiC- 1 + DA-'By'DA- 1 , 

the proposed parameter estimation algorithm in Section 3.1 [(10)-(12] can be equivalently 
rewritten as: 

4=4-i+»-; 1 p i %(2 t -% T 4-i), W 

^=i&+r;Vrfw£. ( 17 ) 

Suppose that P k is initialized by p I , where p is a positive real value large enough, and 
define r k = tr(P fc _1 ) . The relation between r k and | P fc _1 | can be established in the following 

lemma. 

Lemma 4.1. The following relation holds: 

lnEIP^hO^nEr,). (18) 

Proof: Using the formulae 

tr(X) = £4(X) and|X|=nA,(X), 

where n is the dimension of X , we have 

E|P; 1 |<(Er t )". 

This completes the proof. 

The next lemma shows the convergence of two infinite series that will be useful later. 

Lemma 4.2. The following inequalities hold: 

1 (19) 

^MitfEitfPm)* InE | P; 1 I +n lnp a.s., 

A > E(^P iW ) (2°) 

> //.r ^— < oo a.s., 

(InE | Pr'\) c 

where c> 1 . 

Proof: The proof can be done along the similar way as Lemma 2 in (Ding & Chen, 2004b) 
and is omitted here. □ 

The following is the well-known martingale convergence theorem that lays the foundation 
for the convergence analysis of the proposed algorithms. 
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Theorem 4.1. (Goodwin & Sin, 1984) Let {X k } be a sequence of nonnegative random variables 
adapted to an increasing a -algebras {F k } . If 

E(X k+1 \F k )<(l+\)X k -a k +fi k , a.s., 

where a k > , j3 k > 0, EX < oo , 2^ £/ ^ °° an ^ X P 'i < °° a ^ mos ^ surely (a.s.), then X k 
converges a.s. to a finite random variable and 

N 

Km V a- < oo, a.s. 

4.2 Convergence analysis 

To carry out the convergence analysis of the proposed algorithms, it is essential to 
appropriately construct a martingale process satisfying the conditions of Theorem 4.1. Main 
results on the convergence properties of the proposed algorithm are summarized in the 
following Theorem. 
Theorem 4.2. For the system considered in (3), assume that 

(Al) {v k ,F k } is a martingale difference sequence satisfying 

E(^|F fc _ 1 ) = 0,a.s., (21) 

E(^ 2 |F fc _ 1 ) = r,<ao,a.s.; (22) 

1 1 
(A2) is strictly positive real; 

(A3) B z is stable; i.e., zeros o f B z are inside the closed unit disk. 

Suppose the desired output signal is bounded: | y rk |< oo . Applying the missing output estimator 

[Equation (9)], model parameter estimator [Equations (10-14)], and the adaptive control law 
[Equation (15)], then the output tracking error has the property of minimum variance, i.e., 



1 k 

( a ) l im rZ (yr.i -yi +v if = °> a - s -; 

P) ^li^{^i-yr,) 2 \^i-i} = r v <co, a.s. 



Proof: As pointed out in (Goodwin & Sin, 1984; Chen & Guo, 1991), from (A2) it follows that 



it«?<0(l) + of|tyfla.s. 



(23) 



1 70 Adaptive Control 

Here, c a is a positive constant. Define the following vectors: 

e*=z*-p T 4-i/ 

From (2), (3), (16) and (16), it follows that 

r Ik=r k (x h -x btk +v k ), (24) 

^a+cv^j- 1 ^ ( 25 ) 

^=-^+/a- ( 26 ) 

Also define the parameter estimation error vector and a Lyapunov-like function as 

From (9), (16) and (25), we obtain 

9 k =0 k _ x + r;%(p k e k = £_, + r;%_ imk . (27) 

With (17) and (27), V t can be further evaluated as 

V k = V k _, + r?y k {<ple k f + 2r;Y k k % ~ r^P^l - C<pl? k <P k YX- 
Let us define 

1 T ~ 

yk = -<Pk&k + ( 7 ik-v k ). 
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Then we have 

n = V k _ t - 2r? Yk u k y k + 2r; 1 r k tf0 k v k - r;V k P k <p k (i - r;V k P k <p k )e 2 k 

= K-i - 2r; x y k u k y k + 2r; 1 r k (p k k _ 1 v k + 2r; 2 <p T k P k <p k [(e k - y k v k fv k + y k v\\ 
-r;V t P t <P t a - r;V k P„<P t )T k + 2r; 2 <p T k P k <P k a ~ r;V k P k <P k )r k v k 

-r; 2 <PlP k <P k a-r;V k p k <P k y k 

^ V k -i - 2 Cr k u k y k + 2r;V^ T <? t -i^ + 2r; 2 <p k P k <p k [(e k - y k v k ) T v k + y k v\\ 
-r;V k P k <P k (l- - <q>lP k q> k y k + 2r; 2 <p T k P k <p k (l - r;V k P k <P k )T k v k . 



(28) 



Note that (p[0 k _ x , e k - y k v k , (p k P k (p k and r k are uncorrelated with v k and F k _ 1 -measurable. Thus 
taking the conditional expectation of both sides of (28) with respect to F fc _ 1 gives 



-r- 2 Mk E[<plP k <p k {l - r;V k P k <P k )]h 2 - 



Consider that 



Therefore, we have 



= B z u k -A z x b/k 



Vk = 



k\ 2 



(29) 



In (A2), it is assumed that 



v4 2 



is positive real, which indicates 



V^^Z/W/ic^a.s. 

f=i 

Adding S k to both sides of (29) yields 

Efa+SJF^) < V fc _ 1 + S fc _ 1+ 2r;V fc E(^ T P^) 

-^"VfcE [<PkP k <P k (1 - r; x (plP k (p k )] t 

Define a new sequence: 



(30) 



(31) 
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V k+S k , (32) 

W k = * - — ,ol. 

(inE|p;M) c 

Since InE | P fc _1 | is nondecreasing and <p k P k (p k = o(l) , there exists a k such that if k > k we 
have 



(inE^l) (InE IP," 1 If 



* W t -i 



(InE IP; 1 |f 
2r;V,E( % T P t% ) C 33 ) 



From (12) we have 



(inEIP^lf 

■;V t E(i-r;V t T p t ^)t 

(InE | P" 1 |) c 



E(l-r;V k P k <p k )>0. 



Also note that by Lemma 4.2 the summation of the third term in (33) from to go is finite. 
Therefore, Theorem 4.1 is applicable, and it gives 



»r;>,E(l-r;V* T P^)r/ 

> — <oo a.s. (34) 

« (inEIP-lf ( } 

Further, Lemma 4.1 indicates 

I 77V^ < °° a - s ' (35) 

As l-r^Ef^P^) is positive and nondecreasing, it holds that 1 = 1 - r^Eytpf P k (p k ) . 
Hence, 



- < oo a.s. 



Since lim^^ lnEr fc = oo , then from the Kronecker lemma [15] it follows that 
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limA fc =0, a.s., 



where 



With 



and (23), we obtain 



1 k 



Po i=l v 



IP' = f0[QnErj] 



0[E(r k )] 



-O 



-+««Z'". E2 ( z .) +n i»Z M . 2 



■ A '°(j§ y ' 



Po 1=1 

k 



(37) 



By (22) we have 



i|>? = o(i) + o(iJ>fj. 



(38) 



Substituting (37) into (38) gives 



Hyf=0(l), a.s., 



which implies together with (37) that 



1 fc _ 
lim— V ^ 2 =0, a.s., 



or equivalently 
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1 k 

l im rZ (Vr,i -y l +v l ) 2 = °r a.s. 



SSo fc^- ' f " = ( 39 ) 

Since 

E {(yr,*-y* + ^) 2 l F *-i} = E [(yr /fc -yJ 2 + 2 yr,A -^y^ + ^IF^j 

= E[(y r/fc -y fc ) 2 |F fc _ 1 ] + 0-2r o +r, 
= EKy^-yJ^Fj.J-r^a.s., 

and n z k = nVk > we have 

i fe i fc 

! im 7ZM E fe -y r , ; ) 2 1 F^} = lim-^//.E{(y f -y f/1 .) 2 1 F...J = r y/ a.s. 
This completes the proof. □ 

5. Illustrative Examples 

In this section, we give three examples to illustrate the adaptive control design scheme 

proposed in the previous sections. 

The OE model shown in Fig. 2 in the simulation is chosen as 

b n + b,z~ 1 + b 7 z~ 2 
Vk=^ — — — — u k+v k , 
1 + a x z + a 2 z 

which is assumed to be placed in a network environment (Fig. 1) with randomly missing 
output measurements and unknown model parameters. {v k } is a Gaussian white noise 

sequence with zero mean and variance r v = 0.05 2 . The parameter vector 6 = \a x a 2 b \ b 2 f is 

to be estimated. Here, true values of 6 are 

<9 = [-0.3 0.6 0.5 0.2 0.34] T . 

For simulation purposes, we assume that: (1) 6 is unknown and initialized by ones; (2) the 
output measurement {y k } is subject to randomly missing when transmitted to the controller 
node; (3) the availability of the output measurements ( y k ) at the controller node is 
characterized by the probability ju k ; (4) The desired output signal to be tracked is a square 
wave alternating between -1 and 1 with a period of 1000. Mathematically, it is given by 

y r ,<5oo, +i )=(-l)' +1 , '=0,1,2,-, ; = 1,2,...500. 

In the following simulation studies, we carry out experiments for three different scenarios 
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regarding the availability of the output measurements at the controller node and the 
parameter variation, and examine the control performance, respectively. According to the 
proposed adaptive control scheme shown in Fig. 3, we apply the algorithms of the missing 
output estimator, model parameter estimator, and the adaptive control law to the 
networked control system. 

Example 1: ju k = 0.85 . In the first example, 85% of all the measurements are available at the 
controller node after network transmission from the sensor to the controller. The output 
response is shown in Fig. 4, from which it is observed that the output tracking performance 
is satisfactory. In order to take a closer observation on the model parameter estimation and 
output estimation, we define the relative parameter estimation error as 



<V% 



\e k -e\ 
\\o\\ 



100%. 



It is shown in Fig. 5 (solid blue curve) that S % is becoming smaller with k increasing. 

Comparison between the estimated outputs and true outputs during the time range 
501 < t < 550 is illustrated in Fig. 6: The dashed lines are corresponding to the time instants 
when data missing occurs, and the small circles on the top of the dashed lines represent the 
estimated outputs at these time instants. From Fig. 6 it can be found that the missing output 
estimation also exhibits good performance. 


































Fig. 4. Example 1: Output response when ju k = 0.85 . 




Fig. 5. Comparison of relative Parameter estimation errors for Example 1 and Example 2: 
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Blue solid line for Example 1; red dotted line for Example 2. 

Example 2: ju k = 0.65 . In the second example, a worse case subject to more severe randomly 
missing outputs is examined: Only 65% of all the measurements are available at the 
controller node. The output response is shown in Fig. 7. Even though the available output 
measurements are more scarce than those in Example 1, it is still observed that the output is 
tracking the desired signal with satisfactory performance. The relative parameter estimation 
error, 5 ' % , is shown in Fig. 5 (dashed red curve). Clearly, it is decreasing when k is 

increasing. The estimated outputs and the true outputs are illustrated in Fig. 8, from which 
we can see good output estimation performance. 

For the comparison purpose, the relative parameter estimation errors of these two examples 
are shown in Figure 5. We can see that the parameter estimation performance when 
/j k = 0.85 is better than that when ju k = 0.65 .It is no doubt that the estimation performance 

largely depends on data completeness that is characterized by ju k . 




Vk 

o Zk 



Fig. 6. Example 1: Comparison between estimated and true outputs when ju k = 0.85 (The 
dashed line represents output missing). 
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Fig. 7. Example 2: Output response when ju k = 0.65 . 
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Example 3: Output tracking performance subject to parameter variation. In practice, the model 
parameters may vary during the course of operation due to the change of load, external 
disturbance, noise, and so on. Hence, it is also paramount to explore the robustness of the 
designed controller against he influence of parameter variation. In this example, we assume 
that at k = 2500 , model parameters are all increased by 50%. The output response is shown 
in Fig. 9. It can be seen that: At k = 2500 , the output response has a big overshoot because of 
the parameter variation; however, the adaptive control scheme quickly forces the system 
output to track the desired signal again. 

Observing Figs. 4, 7, and 9 in three examples, we notice that the tracking error and 
oscillation still exist. This is mainly due to (1) the missing output measurements, and, (2) the 
relatively high noise-signal ratio (around 25%). On the other hand, it is desirable to develop 
new control schemes to further improve the control performance for networked systems 
subject to limited feedback information, which is worth to do extensive research. 




Fig. 8. Example 2: Comparison between estimated and true outputs when ju k = 0.65 (The 
dashed line represents output missing). 



m^m 



^fW»H 



Parameter variation 
occurs here 



Fig. 9. Example 3: Output response subject to parameter variation: At time instant k = 2500, 
all parameters are increased by 50%. 
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6. Conclusion 

This paper has investigated the problem of adaptive control for systems with SISO OE 
models placed in a network environment subject to unknown model parameters and 
randomly missing output measurements. The missing output estimator, Kalman filter based 
model parameter estimator, and adaptive controller have been designed to achieve output 
tracking. Convergence performance of the proposed algorithms is analyzed under the 
stochastic framework. Simulation examples verify the proposed methods. It is worth 
mentioning that the proposed scheme is developed for SISO systems in this work, and the 
extension to multi-input-multi-output (MIMO) systems is a subject worth further 
researching 
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1. Introduction 

Neural network has good nonlinear function approximation ability. It can be widely used to 
identify the model of controlled plant. In this chapter, the theories of modeling uncertain 
plant by using two kinds of neural networks: feed-forward neural network and recurrent 
neural network are introduced. And two adaptive control strategies for robotic tracking 
control are developed. One is recurrent fuzzy neural network based adaptive control 
(RFNNBAC), and another is neural network based adaptive robust control (NNBARC). In 
RFNNBAC, a kind of recurrent fuzzy neural network (RFNN) is constructed by using 
recurrent neural network to realize fuzzy inference, In which, temporal relations are 
embedded in the network by adding feedback connections on the first layer of the network. 
Two RFNNs are used to identify and control plant respectively. Base on the Lyapunov 
stability approach, the convergence of the proposed RFNN is analyzed. In NNBARC, A 
robust controller and a neural network are combined into an adaptive robust robotic 
tracking control scheme. Neural network is used to approximate the modeling uncertainties 
in robotic system. Then the disadvantageous effects on tracking performance, due to the 
approximating error of the neural network and non-measurable external disturbances in 
robotic system, are attenuated to a prescribed level by robust controller. The robust 
controller and the adaptation law of neural network are designed based on Hamilton-Jacobi- 
Issacs (HJI) inequality theorem. The weights of NN are easily tuned on-line by a simple 
adaptation law, with no need of a tedious and lengthy off-line training phase. 
This chapter is organized in the following manner. In the first section a robust robotic 
tracking controller based on neural network is designed and its effectiveness is proved by 
applying it to control the trajectories of a two-link robot. Secondly, a recurrent fuzzy neural 
network based adaptive control is proposed and simulation experiments are made by 
applying it on robotic tracing control problem to confirm its effectiveness. Finally, some 
conclusions are drawn. 

2. A robust robotic tracking controller based on neural network 

In the past decades, there has been much research on the applications of nonlinear control 
theory to control robots, and many useful properties of robot dynamics such as the skew- 
symmetry property were discovered. There are basically two strategies to control such 
uncertain nonlinear systems: the robust control strategy and the adaptive control strategy. A 
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convenient point of robust control strategy is that it can attenuate disadvantageous effects of 
various uncertainties (e.g., structured parametric uncertainties and unstructured 
disturbances) to a required level, provided that the upper bound of uncertainties is well 
known (Abdallah et al. 1991). However, since this strategy use max-min method to design 
the controller, it can not yield good transient performance. On the other hand, regressor 
matrixes are always used in the design of adaptive control systems for robot manipulators 
(Ortega & Spong 1989). In this situation, the unknown nonlinear dynamics of robotic 
systems are always assumed to be linearly parametrisable. However, there are some 
potential difficulties associated with this classical adaptive control design. For example, the 
unknown parameters may be quickly varying, the linear parametrisable property may not 
hold, computation of the regressor matrix is a time-consuming task, and implementation 
also requires a precise knowledge of the structure of the entire robot dynamic model (Saad 
et al. 1994; Sanner & Slotine 1998; Spooner & Passino 1996). 

It has been shown that multi-layer neural networks can approximate any continuous 
function as accurately as possible. Based on this universal approximation property, many 
important adaptive neural-network-based control schemes have been developed to solve 
highly nonlinear control problem (Sanner & Slotine 1998; Spooner & Passino 1996; Narenra 
& Parthasarathy 1990; Polycarpou 1996). But most of these schemes use grads-descent 
method to train the weights, which can not ensure the stability of whole closed-loop system. 
In the recent years, researchers began to develop the neural-network-based controller with 
closed-loop stability based on the Lyapunov method. A controller based on forward 
propagation network was developed in (Carelli et al. 1995), but it didn't consider the effects 
of uncertainties. An adaptive neural network control strategy with guaranteed stability was 
proposed in (Behera et al. 1996) on the assumption that the approximation error of the 
neural network is known and bounded. 

In the first part of this chapter, we will propose a neural-network-based robust robotic 
tracking controller according to HJI inequation theorem presented by Shen in (Shen 1996). A 
neural network equipped with a robust learning algorithm is introduced firstly to learn the 
modeling uncertainties in robotic system. Then the disadvantageous effects on tracking 
performance caused by neural network approximating error and non-measurable external 
disturbances in robotic system will be attenuated to a prescribed level by the designing a 
robust controller. 

This section is organized as follows. In subsection 2.1, HJI inequation theorem is introduced. 
In subsection 2.2 the dynamics of robot system and its properties are described. The neural 
network based robust control strategy is proposed in subsection 2.3, where the structure of 
robust controller and the robust learning algorithm of neural network are derived. 
Simulations for a two-link robot are presented in subsection 2.4. 

2.1 HJI inequation theorem 

A system with non-measurable disturbance d can be formulated as: 

x = f(x) + g(x)d (1) 

For evaluating the disturbance restraint performance of system (1), an evaluation signal 
z = h(x) is introduced to represent the signals need to be concerned, such as error. And a 
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performance index signal can be defined as: 

J=sup frr (2) 

|d| 2 *0 Il d || 2 

Obviously, smaller J means better disturbance restraint performance. The robust design 
problem of system (1) can be solved by designing a controller to make J less than a 
prescribed level. 

HJI(Hamilton-Jacobi-Isaacs)InequationTheorem: Given an positive constant y > , if there 
exists an derivable function, V(x)>0, which satisfies the following HJI inequation: 

V = ^x = ^f(x) + ^g(x)d<il 2 ||d| 2 HN| 2 lvd (3) 

ax ax w ax 5W 2 * y w 

then the performance index signal of system (1) is less than y , that is to say, J < y . 

2.2 Problem statement 

The kinetics equation of a robotic manipulator with uncertainties can be expressed as: 

M(q)q + V(q, q)q + G(q) + AT(q, q) + d R = T (4) 

where q,q,qeR n is the joint position, velocity, and acceleration vectors; M(q)eR nxn 

denotes the moment of inertia; V(q,q)q are the Coriolis and centripetal forces; G(q) includes 

the gravitational forces; T is the applied torque; AT(q,q) represents the modelling 

uncertainties in robotic system, and d R is external non-measurable disturbance. 

It is well known that the robot dynamics has the following properties. 

Property 1— Boundedness of the Inertia matrix: The inertia matrix M(q) is symmetric and 

positive definite, and satisfies the following inequalities: 

0<X m I<M(q)<X M I (5) 

where A m and A M are known positive constants. 

Property 2 — Skew symmetry: The inertia and centripetal-Coriolis matrices have the 
following property: 

? T {M(q)-2V(q,q)}? = 0,V?€R n (6) 

Property 1 is very important in generating a positive definite function to prove the stability 
of the closed-loop system. Property 2 will help in simplifying the controller. 
The aim of this paper is to design a neural-network-based robust controller (NNBRC) for the 
robot system under uncertainties, such that closed-loop system is guaranteed to be stable 
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and the joint position q(t) can track the desired trajectory q^W rapidly and accurately. 



2.3 Design of NNBRC 

A NNBRC is proposed in this section. In the proposed strategy, a neural network (NN) is 
firstly used for identifing modelling uncertainties AT(q, q) , then, a robust learning algorithm 

and a robust controller are designed based on HJI equation theorem to counteract the 
disadvantageous effects caused by approximation error of the NN and external disturbance 
d R . 



2.3.1 Construction of the neural network 

A three-layer NN is shown in Fig.l.Using u- ' ,o\ ' to denote the input and output of the ith 

node in the 1th layer separately, the signal propagation and the operation functions of the 
nodes in each layer are introduced as follows. 




Fig. 1. Structure of three-layer NN 



Layer 1 - Input Layer: 



(!) (!) • 1 o 

o > ' - u- ' = Xj , i = 1,2, • • • , m 



(7) 



Layer 2 - Hidden Layer: 



uf> = !of),j = lA-,k 



o| 2) = 0j =l/[l + exp(-uj 2) )],j = 1,2, •••,]< 



(8) 
(9) 



Adaptive Control Based On Neural Network 



185 



Layer 3- Output Layer: 



,P)-«P). 



,(2) 



y h = o£' = u£' = ZWhj -of ,h = l,2,...,n 



(10) 
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then the outputs of the three-layer NN can be written as: 

Y = Wo- 



(11) 



In this paper, the three-layer NN described above will be used to identify the modeling 
uncertainties AT(q,q) in robotic system. Using s T to denote the network approximation 
error, then the modeling uncertainties can be denoted by: 



AT (q,q) = W T cr T +£ T AT(q,q) 



(12) 



where W T is the weight matrix, cr T is the activation function vector. 

Substitute (12) into (4), then the dynamics of the robot manipulator with a NN identifier can 



be formulated as: 



M(q)q + V(q, q)q + G(q) + W T cr T + s T + d R = T 



(13) 



Regarding s T as another external disturbance of robotic system, and using £ R = s T + d R , 
then (13) can be rewritten as: 

M(q)q + V(q,q)q + G(q) + W T cr T + £ R = T (14) 

For attenuating disadvantageous effects caused by £ R to a prescribed level, a robust 
learning algorithm of NN and a robust controller can be designed based on HJI equation as 
below 2.3.2. 



2.3.2 Robust controller and NN learning algorithm 

At first, we introduce a control signal u, which satisfies: 

M(q)q d +V(q,q)q d +G(q) + u = T 



(15) 



where qd'qd>qd G R n * s ^ es i rec ^ joint position, velocity, and acceleration vectors separately. 
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Thus, the closed-loop robot control system can be constructed by substituting (15) into (14). 
Let e = q -q d , the closed-loop system can be formulated as: 



M(q)e + V(q,q)e + W T cr T +s R =n 



(16) 



By regarding s R as external disturbance and introducing the evaluation signal z R = pe , 
where p is a positive constant, we can define the index signal as: 



] R = sup 



Z R 



\\£R\\ 2 * \\ £R h 



(17) 



The idea of NNBRC is to design controller u and the NN learning algorithm W T such that 
J R is less than a prescribed level, y . 
Define two state variables as: 



(18) 



|x 2 =e + ae 
where a is an prescribed positive constant. Thus, system (16) can be rewritten as: 



x-i =x? -ax-] 



I Mx 2 = -Vx 2 + co- W T cr T - s R +u 
where oo = Mae + Vae , W T is a n x k matrix that can be described as: 



W T = 



W TH W T12 ■■■ w Tlk 
W T21 W T22 •" w T2k 



= [v 



Tl W T2 ••* w TkJ 



_w Tnl w Tn2 ... w Tnk _ 
Theorem 1: Considering system (19), if the learning algorithm of NN is: 

W T = -7]Wj 

The controller u is designed as: 

U = -Xi -&> + W T CF T - So + — - 

I 2Y^ 



(19) 



(20) 



(21) 



and the parameter p in the evaluation signal , z R = pe = pxi , satisfies: 
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a-\v 2 =H (22) 

where s^ , £ 2 an d rj are all prescribed positive constant, then the disturbance restraint index 

signal of system (19), J R , is less than y . 

Proof: Considering system (19), we define the following derivable function: 

L = |-x a T - Xl +~x\ Mx 2 +|-|W T | 2 (23) 

Thus, 

L = x 1 x 1 +x 2 Mx 2 +-x 2 Mx 2 + Xw Ti w Ti 
2 i=i 

= x 1 (x 2 -ax 1 )+x 2 (&>-W T cr T -£ R +u) 
\ T m T 

+ — x 2 (M-2V)x 2 + Zw Ti w Ti 
2 i=i 

According to Property 2 of the robot dynamics, the above equation can be rewritten as: 

L = Xi (x 2 - ax-i ) + x 2 (co - Wjcrj - s R + u) + X w Ti w Ti 

i=l 

= ~X 1 OX 1 +X 2 (X 1 + CO-Wj(Jj - £ R + u) + Xw Ti w Ti 

i=l 

Substituting (20) into above equation, then 

L = -x 1 ax 1 + x 2 (x 1 + co- W T cr T - s R + u) - r/jWj | 
Regarding £ R as external disturbance, let 

tt t" 1 2 II i|2 1 || ||2 

H = L — r \\£t?\\ + — zr 

l~) ' II ^ II n II ^ II 

= -a|x 1 | +x 2 (xi + &>-W T cr T + u)-x 2 £ R 

IItat II 2 1 2 1| ||2 1 2 1| ||2 

-#T / k R +— P Xl 

II J- || r\ i ||1\|| O H H 

--\a — p |xi| +x 2 (x! + &>-W T cr T + u) 

T II I|2 1 2n I|2 

-X 2 fR-77||W T | "-/ \\£ R \\ 
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v-2x£s R -r 2 |^R|| 2 =-J2xJ^ R + r 2 |f R | 2 | 



1 II l|2 o T 2 II ||2 1 II ii2 

— - x 2 +2x 2 f R +X * R -— x 2 



-x 2 +rf R 



1 II l|2 
"TNI 



r 



-x 2 +/£r 

r 

1 II l|2 



l|2 . J_ 

2r 2 

I 1 9 in Il2 T / \ ll ll2 1 ii 
H<-\a p Xi + Xo Xi + G)-VJ T <J T +UJ-/7 W T + -Xn 

^ 2 J" " ' 'll in 2/ 2 

1 1 „ „2 

Xi + &> -W T <T T + U H r-X 2 - 77 W T 

2 r 2 



1 2 )|| ||2 T 

a ~2 P lll Xl H +X2 



Substituting (21), (22) into above inequation, then 



11 ll2 11 m2 11 m2 

H< -^ xj || -^2|| x 2|| -7 7||Wt|| <0 

t' ^ 1 I 2 1| ||2 11 ||2 

•• L ^2 V lr R l "1 Zr II 



According to HJI equation theorem, we can conclude that the disturbance restraint 
performance index signal of system (19), J R , is less than y . The structure of the proposed 
neural network based robust control strategy is illustrated in Fig. 2. 



*id 



Mq d + Vq d + G 



Id I + e 

— 4 — *0 — H 

— W^- — A 



%a 






I »■ Robot r 




<— > 



Fig. 2. Structure of the NN-based robust tracking control system 
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2.4 Simulation example 

In this section, the proposed control strategy will be applied to control the trajectory of a 
two-link robot (see Fig. 3) for proving its effectiveness. 




Fig. 3. Two-link robot 



In Fig.3, ml and m2 are masses of arml and arm2 respectively; 11 and 12 are lengths of arml 
and arm2; tl and t2 are torques on arml and arm2; di and 6 2 are positions of arml and 
arm2. The dynamics model of two-link robot is same as (4). 
Let 



q = [0i e 2 ] T ,q = [ei e 2 ] T 



(24) 



q = [fli 92] T ,T = [ tl t 2 ] T 
q = cosBi , Si ^sinSi, c^ =cos(0 i +6j) 



(25) 
(26) 



then M, V,G in (4) can de described as: 



M(q) = 



2 2 2 2 

m 1 l 1 +m 2 (li +1 2 +21il 2 c 2 ) m 2 l 2 +m 2 l 1 l 2 c 2 



m 2 l 2 +m 2 l 1 l 2 c 2 



m 2 l 2 



(27) 



V(q,q) = 



-2m 2 l 1 l 2 s 2 q 2 -m 1 l 1 l 2 s 2 q 2 
m 2 l 1 l 2 s 2 q 1 



(28) 



G(q) = 



m 2 l 2 gc 12 +(m 1 +m 2 )ligc 1 
m 2 l 2 gc 12 



(29) 
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In this paper, the parameters of the two-link robot are m 1 = 10 kg, m 2 = 2 kg, l^ =1.1 m, 
and 1 2 =0.8 m. The Initial states are q(0) = [0.5 0.5] rad, q(0) = [0 0] T rad/s, and 
q(0) = [0 0] rad/s 2 . The desired trajectories can be described as: 

q d (t) = [sin(2^t) cos(2^t)] T rad 



q d (t) = [2/zcos(2;rt) - 2^sin(2^t)] T rad/s 

q d (t) = [- 4^ 2 sin(2M) - 4^ 2 cos(2^t)f rad/s 2 
The model error due to friction is assumed as: 



AT: 



■0.5sign(e 1 )[0.1 + exp(-|e 1 |)]" 

sign(e 2 )[0.2 + exp(-|e 2 |)] 



N-m 



(30) 



(31) 



(32) 



(33) 



The external disturbance, d R =[d a d 2 ] T is a random signal which amplitude is less than 
ION m. In simulations, the NNBRC can be designed based on (21), in which a = 50 , 
8\ = 0.1 ,8*1 = 0.1 , y = 0.05 , p = 9 . The NN learning algorithm is designed according to (20), 
where rj = 0.1 . 

Fig.4 and Fig.5 present the simulation experiment results, in which, proposed control 
strategy is compared to traditional robust control (TRC) strategy. From these results, we can 
conclude that the NN-based robust tracking control strategy proposed in this paper can 
counteract disadvantageous effects caused by uncertainties in robotic system efficiently, and 
can achieve better transient performance than traditional robust control. 
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Fig. 5. Robot tracking errors 



3. A Recurrent Fuzzy Neural Network Based Adaptive Control 

Recently, much research has been done on using neural networks (NN) to identify and 
control dynamic systems (Park et al. 1996; Narendra & Parthasarathy 1990; Brdys & 
Kulawski 1999). NN can be classified as feed forward neural networks and recurrent neural 
networks. Feed forward neural networks can approximate a continuous function to an 
arbitrary degree of accuracy. However, feed forward neural network is a static mapping; it 
can not represent a dynamic mapping. Although this problem can be solved by using 
tapped delays, feed forward neural network requires a large number of neurons to represent 
dynamical responses in the time domain. Moreover, since the weight updates of feed 
forward neural network is irrelative to the internal information of neural network, the 
function approximation is sensitive to the training data. On the other hand, recurrent neural 
networks (Ku & Lee 1995; Ma & Ji 1998; Sundareshan & Condarcure 1998; Liang & Wang 
2000) are able to represent dynamic mapping very well and store the internal information 
for updating weights later. Recurrent neural network has an internal feedback loop; it 
captures the dynamical response of a system without external feedback through delays. 
Recurrent neural network is a dynamic mapping and demonstrates good performance in the 
presence of uncertainties, such as parameter variations, external disturbance, unmodeled 
and nonlinear dynamics. However, the drawbacks of recurrent neural network, which are 
same as neural network, are that the function of the network is difficult to interpret and few 
efficient constructive methods can be found for choosing network structure and determining 
the parameters of neurons. 

As is widely known, both fuzzy logic systems and neural network systems are aimed at 
exploiting human-like knowledge processing capability. In recent years, researchers started 
to recognize that fuzzy control has some similarities to neural network (Jang & Sun 1993; 
Hunt et al. 1996; Buckley et al. 1993; Reyneri 1999). Fuzzy neural network (FNN), which uses 
NN to realize fuzzy inference, combines the capability of fuzzy reasoning in handling 
uncertain information and the capability of neural networks in learning from processes. It is 
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possible to train NN using the experience of human operators expressed in term of linguistic 
rules, and interpret the knowledge that NN acquired from training data in linguistic form. 
And it is very easy to choose the structure of NN and determine the parameters of neurons 
from linguistic rules. However, a major drawback of the FNN is that its application domain 
is limited to static problems due to its feed forward network structure. 

Recurrent fuzzy neural network (RFNN) is a modified version of FNN, which use recurrent 
network for realizing fuzzy inference and can be constructed from a set of fuzzy rules. It 
inherits all characteristics of FNN such as fuzzy inference, universal approximation and 
convergence properties. Moreover, with its own internal feedback connections, RFNN can 
temporarily store dynamic information and cope with temporal problems efficiently. For 
this ability to temporarily store information, the structure of RFNN is much simpler than 
FNN. Fewer nodes are required in RFNN for system identification. 

In this section, a recurrent fuzzy neural network structure is proposed, in which, the 
temporal relations are embedded by adding feedback connections on the first layer of FNN. 
Back propagation algorithm is used to train the proposed RFNN. To guarantee the 
convergence of the RFNN, the Lyapunov stability approach is applied to select appropriate 
learning rates. For control problem, an adaptive control scheme is proposed, in which, two 
proposed RFNN are used to identify and control plant respectively. Finally, simulation 
experiments are made by applying proposed adaptive control scheme on robotic tracking 
control problem to confirm its effectiveness. 

This section is organized as follows. In subsection 3.2, RFNN is constructed. The 
construction of RFNNBAC is presented in subsection 3.3. Learning algorithms of RFNN are 
derived in subsection 3.4. Stability of RFNN is analyzed in subsection 3.5. In subsection 3.6 
proposed RFNNBAC is applied on robotic tracking control and simulation results are given. 
Finally, some conclusions are drawn in subsection 3.7. 

3.1 Construction of RFNN 

The structure of the proposed RFNN is shown in Fig. 6, which comprises n input variables, 
m term nodes for each input variable, / rule nodes, and p output nodes. This RFNN thus 
consists of four layers and n + (n x m) + I + p nodes. 

Using uj , Oj to denote the input and output of the ith node in the kth layer separately, the 

signal propagation and the operation functions of the nodes in each layer are introduced as 

follows. 

Layer 1 (Input Layer): This layer accepts input variables. Its nodes transmit input values to 

the next layer. Feedback connections are added in this layer to embed temporal relations in 

the network. For every node in this layer, the input and output are represented as: 

u 1 1 (k)=x 1 1 (k)+w 1 1 1 1 (k-l),0 1 1 (k) = u 1 1 (k),i = l,2,---,n (34) 

where k is the number of iterations; w { is the recurrent weights. 

Layer 2 (Membership Layer): Nodes in this layer represent the terms of respective linguistic 
variables. Each node performs a Gaussian membership function 
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(°i- a ij) 2 „2 



Oii 



,Oij = exp(u ;j 



(35) 



where i = 1,2, • • • , n , j = 1,2, • • • , m ; ay and b^ are the mean and the standard deviation of the 

Gaussian membership function; the subscript ij indicates the jth term of the ith input 
variable. 




Outyut 
Layer 



Rule 
Layer 



Membership 
Layer 



Input 
Layer 



Fig. 6. Structure of four-layer RFNN 



Layer 3(Rule Layer): This layer forms the fuzzy rule base and realizes the fuzzy inference. 
Each node is corresponding to a fuzzy rule. Links before each node represent the 
preconditions of the corresponding rule, and the node output represents the " firing 
strength" of corresponding rule. 
If the qth fuzzy rule can be described as: 
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qth rule: if x^ is A^, x 2 is A 2 q , ... , x n is A^ then y^ is B^ , y 2 is B 2 q , ..., y p is B^ , 
where A^ is the term of the ith input in the qth rule; B - q is the term of the jth output in the 

qth rule. 

Then, the qth node of layer 3 performs the AND operation in qth rule. It multiplies the input 

signals and output the product. 

Using O iq . to denote the membership of x { to A { q , where q { e{l,2,---,m}, then the input 

and output of qth node can be described as: 

Uq =nO? qi ,0 3 q = Uq \i = l,2,-,n;q = l,2,-,l (36) 

i 

Layer 4(Output Layer): Nodes in this layer performs the defuzzification operation, the input 
and output of sth node can be calculated by: 

4=Zw^, O s 4 =-HL. (37) 

q £O q 

q 

where s = l,2,---,p, q = l,2 / ---,l / w sq is the center of B- q , which represents the output 

action strength of the sth output associated with the qth rule. 

From the above description, it is clear that the proposed RFNN is a fuzzy logic system with 
memory elements in first layer. The RFNN features dynamic mapping with feedback and 
more tuning parameters than the FNN. In the above formulas, if the weights in the feedback 

unit Wj are all equal to zero, then the RFNN reduces to an FNN. Since a fuzzy system has 

clear physical meaning, it is very easy to choose the number of nodes in each layer of RFNN 

and determine the initial value of weights. Note that the parameters Wj of the feedback 

units are not set from human knowledge. According to the requirements of the system, they 
will be given proper values representing the memorized information. Usually the initial 
values of them are set to zero. 

3.2 Structure of RFNNBAC 

In this section, the structure of RFNNBAC will be developed below, in which, two proposed 
RFNN are used to identify and control plant respectively. 

3.2.1 Identification based on RFNN 

Resume that a system to be identified can be modeled by an equation of the following form: 



y(k)=f(y(k-l),...y(k-n y ), U (k),---,u(k-n u )) (38) 
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where u is the input of the system, n y is the delay of the output, and n u is the delay of the 

input. 

Feed forward neural network can be applied to identify above system by using y(k-l),... 

,y(k-n-l), u(k), ... , u(k-m) as inputs and approximating the function f . 

For RFNN, the overall representation of inputs x and the output y can be formulated as 



y(k) = g(ol(k),-,oi(k)) 



(39) 



Where 

oHk)= Xi (k)+wHk)oHk-l) 

= x i (k) + wHk)[x i (k-l) + wHk-l)oHk-2)] 

= x i (k) + wJ(k)x i (k-l)+wKk)wMk-l)xi(k-2)+--- 
+ wKk)wKk-l)-wHl)x i (0) 

Using the current input u(k) and the most recent output y(k-l) of the system as the inputs of 
RFNN, (39) can be modified as: 



y(k) = f(y(k-l),-,y(0),u(kV»,u(0)) 



(40) 



By training the RFNN according to the error e(k) between the actual system output and the 
RFNN output, the RFNN will estimate the output trajectories of the nonlinear system (38). 
The training model is shown in Fig. 7. 
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Fig. 7. Identification of dynamic system using RFNN 
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From above description, For Using RFNN to identify nonlinear system, only y(k-l) and u(k) 
need to be fed into the network .This simplifies the network structure, i. e., reduces the 
number of neurons 



3.2.2 RFNNBAC 

The block diagram of RFNNBAC is shown in Fig. 8. In this scheme, two RFNNs are used as 
controller (RFNNC) and identifier (RFNNI) separately. The plant is identified by RFNNI, 
which provides the information about the plant to RFNNC. The inputs of RFNNC are e(k) 
and e(k) . e(k) is the error between the desired output r(t) and the actual system output 

y(k). The output of RFNNC is the control signal u(k), which drives the plant such that e(k) is 
minimized. In the proposed system, both RFNNC and RFNNI have same structure. 
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Fig. 8. Control system based on RFNNs 



3.3 Learning Algorithm of RFNN 

For parameter learning, we will develop a recursive learning algorithm based on the back 
propagation method 

3.3.1 Learning algorithm for identifier 

For training the RFNNI in Fig.8, the cost function is defined as follows: 



I,(k)4i(e [s (k)) 2 =E(y s (k)-y Is (k)F 
2 S =1 



s=l 



(41) 



where y s (k) is the sth output of the plant, y Is (k) = O s is the sth output of RFNNI, and 

e Is (k) is the error between y s (k) and yi s (k) for each discrete time k. 

By using the back propagation (BP) algorithm, the weights of the RFNNI is adjusted such 
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that the cost function defined in (41) is minimized. The BP algorithm may be written briefly 
as: 



W x (k + 1) = W x (k) + AWj (k) 



= W I (k) + i 7 i 



dJi(k) 

awjfk) 



(42) 



where t/j represents the learning rate and Wj represents the tuning weights, in this case, 
which are w Isq , a Iiq . , b Iiqi , and w n . Subscript I represents RFNNI. 

According to the RFNNI structure (34)~(37), cost function (41) and BP algorithm (42), the 
update rules of RFNNI weights are 



Where 



wi 4 sq (k + l) = w 1 4 (k)-^ 



w4 



gJife) 

C^sqW 



(43) 



a Iiqi( k + l) = a Iiq i ( k )-'7i a 



gJiOO 

3a Hqi (k) 



(44) 



bii qi (k + l)=b Iiq .(k)-^ 



b SJi(k) 



<5b Hqi (k) 



(45) 



wj i (k + l) = wj i (k)-^ 



wl 



gJi(k) 
5wh(k) 



(46) 



**« =-e Is (k) °* 



5w fsq(k) 

aj r (k) , . wL -o 



s ^Io 3 



iq 



5a ii qi (k) 
5Ji(k) _ 



-Ie Is (k)- 



4 -O 4 

Is q Is n 3 
3 °iq" 



ZO 



iq 



2(oh-a Iiq .) 

(bliqj 2 



q 
. ■ v w 4 sq -Q 4 s 2(o/ i -a Iiq .) 2 

ab Iiqi(k )--? e - (k) -^^ q —°-.- ( bliqi )3 



ZOf q 

q 



5 Jl( k ) _ ^„ ^ W Isq-°^ ^3 - 2 (°Ii- a Iiqi) rt l 



5Wii(k) q s 



-ZZe Is (k)- 
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• o 
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3.3.2 Learning algorithm for controller 

For training RFNNC in Fig. 8, the cost function is defined as 

Jc(k)4Hs( k )) 2 =£(rs(k)-y s (k)) 2 (47) 

^ s=l s=l 

where r s (k) is the sth desired output, y s (k) is the sth actual system output and e s (k) is 
the error between r s (k) and y s (k) . 
Then, the gradient of J c is 

a Jc_£ 5 Jc dy s 



aw c 7dy s 5W c 

o(k)' 

au (k) 



Z _e s(k )^W.^W, (48 ) 

f sV ; au (k) aw c { > 



: £-e s (k)yu so (k)- 



aw r 



where u is the oth control signal, which is also the oth output of RFNNC, and 
yu so (k) = dy s (k)/du (k) denotes the system sensitivity. Thus the parameters of the RFNNC 
can be adjusted by 

W c (k + 1) = W c (k) + AW C (k) 

dj c (kh (49) 



:W c (k) + /7 C (- 



aw c (k) 7 



Note that the convergence of the RFNNC cannot be guaranteed until yu so (k)is known. 
Obviously, the RFNNI can provide yu so (k) to RFNNC. Resume that the oth control signal 
is also the oth input of RFNNI, then yu so (k) can be calculated by 

ay s (k) _ s aof s gQj'q ao^ ao/ Q 

3u (k) q80 f q 50? oqo 50[ 3u 

y*U-°i Q 3 -2(Q?o-a loqo ) (50) 

q Iof q Iq (b Ioqo ) 2 
q 

3.4 Stability analysis of the RFNN 

Choosing an appropriate learning rate 77 is very important for the stability of RFNN. If the 

value of the learning rate 77 is small, convergence of the RFNN can be guaranteed, however, 
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the convergence speed may be very slow. On the other hand, choosing a large value for the 
learning rate can fasten the convergence speed, but the system may become unstable. 

3.4.1 Stability analysis for identifier 

For choosing the appropriate learning rate for RFNNI, discrete Lyapunov function is 
defined as 

L I (k) = J I (k) = i2:(ei s (k)) 2 (51) 

^ s 
Thus the change of the Lyapunov function due to the training process is 

AL I (k) = L I (k + l)-L I (k) 

*• s ^ s 

= is[(e Is (k + l)) 2 -(e Is (k)) 2 ] 

^ S 

= ^l[(eis(k + l)+e Is (k))-(e Is (k + l)-e Is (k))] (52) 

^ S 

= il[(2e Is (k)+Ae Is (k)).Ae Is (k)] 

^ S 

= il[(Ae Is (k)) 2 +2e Is (k)Ae Is (k)] 

^ S 

= ix(Ae Is (k)) 2 +iz[2e Is (k)Ae Is (k)] 

^ S ^ S 

The error difference due to the learning can be represented by 

Ae Is (k) = e Is (k + l)-e Is (k)«|^||.AW I (k) (53) 



Where 



iy ' n dW 1 {k) ll sde Is (k) SW^k) 

s aw I (k) 



So (52) can be modified as 
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AL(k)=is 

^ S 



gglsOO ( „ dhfr) 
_ ^ 



aw T (k) ^ aw T (k) 



4? 



2e „ (k) .*l4kl.f_, 1 *W 



aw,(k) I, aw,(k) 



l(„ gJi(k) f^f ge Is (k) f aji(k) 
"2[' 7l aw I (k)J fUw,(k)J "^sw^'t 



eis(k) 



ge ts (k) 

aw,(k) 



If- gJi(K)f.fge Is (k)f 

n2T a, /, x\2 



5Ji(k) 

aWj(k) 



= 2 m 



l<5W,(k) 



7iZ 



ge Is (k) 

l<5W t (k) 



(54) 



To guarantee the convergence of RFNNI, the change of Lyapunov function ALj (k) should 
be negative. So learning rate must satisfy the following condition: 



0<7 I (k)<2/Z 



(Se ls (kj 



For the learning rate of each weight in RFNNI, the condition (22) can be modified as 



(55) 



0<^ w4 (k)<2/ max 
q 



( \ 1 

deisfr) 

dw* (k) 

Isq v ; 



(56) 



0</7f(k)< 2/ max 

q,i 
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3.4.2 Stability analysis for controller 

Similar to (51), the Lyapunov function for RFNNC can be defined as 
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L C (k) = J C (k)4^( e s(k)) 2 



(60) 



So, similar to (56)-(59), the learning rates for training RFNNC should be chosen according to 
the following rules: 
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3.5 Simulation Experiments 

Dynamics of robotic manipulators are highly nonlinear and may contain uncertain elements 
such as friction and load. Many efforts have been made in developing control schemes to 
achieve the precise tracking control of robot manipulators. Among available options, neural 
networks and fuzzy systems (Er & Chin 2000; Llama et al. 2000; Wang & Lin 2000; Huang & 
Lian 1997) are used more and more frequently in recent years. In the simulation experiments 
of this chapter, the proposed RFNNBAC is applied to control the trajectory of the two-link 
robotic manipulator described in chapter 2.4 to prove its effectiveness. 
In the simulation, the parameters of manipulator are m^ =4 kg, m 2 =2 kg, 1^ =1 m, 1 2 =0.5 

m, g =9.8 N/kg. Initial conditions are given as 0^0) =0 rad, 2 (0) =1 rad, (^(O) =0, 

and6 2 (0)=0 rad/s. The desired trajectory is given by ©i(t) = sin(2^t) and 2 (t) = cos(2^t) . 
The friction and disturbance terms in (4) are assumed to be 



5cos(5t) 
5cos(5t) 



Nm, AT(q, q) = 0.5sign(q) Nm. 
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Simulation results are shown in Fig.9 ~Fig.l4. Fig.9 and Fig.10 illustrate the trajectories of 

two joints; the two outputs of identifier (RFNNI) are shown in Fig.ll and Fig.12 separately; 

the cost function for RFNNC is shown in Fig.13; and Fig.14 shows the cost function for 

RFNNI. 

From simulation results, it is obvious that the proposed RFNN can identify and control the 

robot manipulator very well. 
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Fig. 9. Trajectory of jointl 
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Fig. 10. Trajectory of joint2 




0.5 1 

Time(Sec) 



1.5 



Fig. 11. Identifier (RFNNI) outputl 
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Fig. 12. Identifier (RFNNC) output2 
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Fig. 13. Cost function for RFNNC 
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Fig. 14. Cost function for RFNNI 
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4. Conclusion 

In this paper, the adaptive control based on neural network is studied. Firstly, a neural 
network based adaptive robust tracking control design is proposed for robotic systems 
under the existence of uncertainties. In this proposed control strategy, the NN is used to 
identify the modeling uncertainties, and then the disadvantageous effects caused by neural 
network approximating error and external disturbances in robotic system are counteracted 
by robust controller. Especially the proposed control strategy is designed based on HJI 
inequation theorem to overcome the approximation error of the neural network bounded 
issue. Simulation results show that proposed control strategy is effective and has better 
performance than traditional robust control strategy. Secondly, an RFNN for realizing fuzzy 
inference using the dynamic fuzzy rules is proposed. The proposed RFNN consists of four 
layers and the feedback connections are added in first layer. The proposed RFNN can be 
used for the identification and control of dynamic system. For identification, RFNN only 
needs the current inputs and most recent outputs of system as its inputs. For control, two 
RFNNs are used to constitute an adaptive control system, one is used as identifier (RFNNI) 
and another is used as controller (RFNNC). Also to prove the proposed RFNN and control 
strategy robust, it is used to control the robot manipulator and simulation results verified 
their effectiveness. 
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1. Introduction 

The control problem of the two-mass system originally derives from rolling-mill drives 
(Sugiura & Hori, 1996), (Ji & Sul, 1995), (Szabat & Orlowska-Kowalska, 2007). Large inertias 
of the motor, rolls and long shaft create an elastic system. The motor speed is different from 
the load side and the shaft undergoes large torsional torque. A similar problem exists in the 
field of conveyer drives (Hace et al., 2005). Also the performance of the machines used in 
textile industry is reduced by the non-ideal characteristics of the shaft (Beineke et al., 1997), 
(Wertz et al., 1999). An analogous problem appears in the paper machine sections 
(Valenzuela et al., 2005) and in modern servo-drives (Vukosovic & Stojic, 1998), (O'Sullivan 
et al., 2007), (Shen & Tsai, 2006). Moreover, torsional vibrations decrease the performance of 
the robot arms (Ferretti et al., 2004), (Huang & Chen, 2004). This problem is especially 
important in the field of space robot manipulators. Due to the cost of transport, the total 
weight of the machine must be drastically reduced. This reduces the stiffness of the 
mechanical connections which in turn influences the performance of the manipulator in a 
negative way (Katsura & Ohnishi, 2005), (Ferretti et al., 2005). The elasticity of the shaft 
worsens the performance of the position control of deep-space antenna drives (Gawronski et 
al, 1995). Vibrations affect the dynamic characteristics of computer hard disc drives (Ohno 
& Hara, 2006) and (Horwitz et al, 2007). 
Torsional vibrations can appear in a drive system due to the following reasons: 

- changeability of the reference speed; 

- changeability of the load torque; 

- fluctuation of the electromagnetic torque; 

- limitation of the electromagnetic torque; 

- mechanical misalignment between the electrical motor and load machine; 

- variations of load inertia 

- unbalance of the mechanical masses; 

- system nonlinearities, such as friction torque and backlash. 

The simplest method to eliminate the oscillation problem (occurring while the reference 
speed changes) is a slow change of the reference velocity. Nevertheless, it causes the 
decrease of the drive system dynamics and does not protect it against oscillations appearing 
when the disturbance torque changes. The conventional control structure based on the PI 
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speed controller, tuned by the classical symmetric criterion, with a single feedback from the 
motor speed is not effective in damping the speed oscillations. One of the simplest ways to 
improve the torsional vibrations ability of the classical structure is presented in (Zhang & 
Furusho, 2000). It is based on the suitable selection of the system closed-loop poles. However, 
this method improves the drive performance only for a limited range of the system 
parameters. 

When the resonant frequency of the system excides hundreds of Hertz, the application of the 
digital filters is an industrial standard. The Notch-filter is usually mentioned as a tool 
ensuring the damping of the oscillations (Vukosovic & Stojic, 1998), (Ellis & Lorenz, 2000). 
Rarely a low-pass filter or Bi-filter is used. The digital filters can damp the torsional vibration, 
yet the dynamics of the system may be affected. 

To improve performances of the classical control structure with the PI controller, the 
additional feedback loop from one selected state variable can be used. The additional 
feedback allows setting the desired value of the damping coefficient, but the free value of the 
resonant frequency cannot be achieved simultaneously (Szabat & Orlowska-Kowalska, 2007). 
According to the literature, the application of the additional feedback from the shaft torque is 
very common (Szabat & Orlowska-Kowalska, 2007). The design methodology of that system 
can be divided into two groups. In the first framework the shaft torque is treated as the 
disturbance. The simplest approach relies on feeding back the estimated shaft torque to the 
control structure, with the gain less than one. The more advanced methodology, called 
Resonance Ratio Control (RRC) is presented in (Hori et al., 1999). The system is said to have 
good damping ability when the ratio of the resonant to antiresonant frequency has a 
relatively big value (about 2). The second framework consists in the application of the modal 
theory. Parameters of the control structure are calculated by comparison of the characteristic 
equation of the whole system to the desired polynomial. To obtain a free design of the 
control structure parameters, i.e. the resonant frequency and the damping coefficient, the 
application of two feedbacks from different groups is necessary. The design methodology of 
this type of the systems is presented in (Szabat & Orlowska-Kowalska, 2007). 
The control structures presented so far are based on the classical cascade compensation 
schemes. Since the early 1960s a completely different approach to the analysis of the system 
dynamics has been developed - the state space methodology (Michels et al., 2006). The 
application of the state-space controller allows to place the system poles in an arbitrary 
position so theoretically it is possible to obtain any dynamic response of the system. The 
suitable location of the closed-loop system poles becomes one of the basic problems of the 
state space controller application. In (Ji & Sul, 1995) the selection of the system poles is 
realized through LQ approach. The authors emphasize the difficulty of the matrices selection 
in the case of the system parameter variation. The influence of the closed-loop location on the 
dynamic characteristics of the two-mass system is analyzed in (Qiao et al., 2002), (Suh et al., 
2001). In (Suh et al., 2001) it is stated that the location of the system poles in the real axes 
improve the performance of the drive system and makes it more robust against the parameter 
changing. 

In the case of the system with changeable parameters more advanced control concepts have 
been developed. In (Gu et al., 2005), (Itoh et al., 2004) the applications of the robust control 
theory based on the H^ and /u-synthesis frameworks are presented. The implementation of 
the genetic algorithm to setting of the control structure parameter is shown in (Itoh et al., 
2004). The author reports good performance of the system despite the variation of the inertia 
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of the load machine. The next approach consists in the application of the sliding-mode 
controller. For example, in paper (Erbatur et al., 1999) this method is applied to controlling 
the SCARA robot. A design of the control structure is based on the Lyapunov function. The 
similar approach is used in (Hace et al., 2005) where the conveyer drive is modelled as the 
two-mass system. The authors clam that the design structure is robust to the parameter 
changes of the drive and external disturbances. Other application examples of the sliding- 
mode control can be found in (Erenturk, 2008). The next two frameworks of control 
approach relies on the use of the adaptive control structure. In the first framework the 
controller parameters are adjusted on-line on the basis of the actual measurements. For 
instance in (Wang & Frayman, 2004) a dynamically generated fuzzy-neural network is used to 
damp torsional vibrations of the rolling-mill drive. In (Orlowska-Kowalska & Szabat, 2008b) 
two neuro-fuzzy structures working in the MRAS structure are compare. The experimental 
results show the robustness of the proposed concept against plant parameter variations. In 
the other framework changeable parameters of the plant are identified and then the 
controller is retuned in accordance with the currently identified parameters. The Kalman 
filter is applied in order to identify the changeable value of the inertia of the load machine 
(Orlowska-Kowalska & Szabat, 2008a). This value is used to correct the parameters of the PI 
controller and two additional feedbacks. A similar approach is presented in (Hirovonen et 
al., 2006). In the paper (Cychowski et al., 2008) the model predictive controller is applied o 
ensure the optimal control of the system states taking the system constrains into 
consideration. In order to reduce the computational complexity the explicit version of the 
controller is suggested to real-time implementation. 

This paper is divided into seven sections. After an introduction, the mathematical model of 
the two-mass drive system and utilised control structure are described. In section IV, the 
mathematical model of the NEKF is presented. The simulation results of the non-adaptive 
and adaptive NEKF are demonstrated in sections V. The proposed adaptation mechanism is 
described and the analysed algorithms are compared. After a short description of the 
laboratory set-up, the experimental results are presented in section VI. Conclusions are 
presented at the end of the paper. 

2. The mathematical model of the two-mass system and the control structure 

In technical papers there exist many mathematical models, which can be used for the 
analysis of the plant with elastic couplings. In many cases the drive system can be modelled 
as a two-mass system, where the first mass represents the moment of inertia of the drive and 
the second mass refers to the moment of inertia of the load side. The mechanical coupling is 
treated as an inertia free. The internal damping of the shaft is sometimes also taken into 
consideration. Such a system is described by the following state equation (Szabat & 
Orlowska-Kowalska, 2007) (with non-linear friction neglected): 
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where: Q\- motor speed, Qi- load speed, M e - motor torque, M s - shaft (torsional) torque, Ml- 
load torque, Ji - inertia of the motor, J2- inertia of the load machine, K c - stiffness coefficient, 
D - internal damping of the shaft. 

The described model is valid for the system in which the moment of inertia of the shaft is 
much smaller than the moment of the inertia of the motor and the load side. In other cases a 
more extended model should be used, such as the Rayleigh model of the elastic coupling or 
even a model with distributed parameters. The suitable choice of the mathematical model is 
a compromise between the accuracy and calculation complexity. As can be concluded from 
the literature, nearly in all cases the simplest shaft-inertia-free model has been used. 
To simplify the comparison of the dynamical performances of the drive systems of different 
power, the mathematical model (1) is expressed in per unit system, using the following 
notation of new state variables: 
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m v = m r 
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(2) 



where: On - nominal speed of the motor, Mn - nominal torque of the motor, coi, coz- motor 
and load speeds, m e , m s , mi - electromagnetic, shaft and load torques in per unit system. 
The mechanical time constant of the motor - Ti and the load machine - T2 are thus given as: 
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The stiffness time constant - T c and internal damping of the shaft - d can be calculated as 
follows: 
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Taking into account the equations (3) -(5) the state equation of the two-mass system in per- 
unit value is represented as: 
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Usually, due to its small value the internal damping of the shaft d is neglected in the 
analysis of the two-mass drive system. 



3. Adaptive control structure 

A typical electrical drive system is composed of a power converter-fed motor coupled to a 
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mechanical system, a microprocessor-based controllers, current, rotor speed and/ or position 
sensors used as feedback signals. Typically, cascade speed control structure containing two 
major control loops is used, as presented in Fig 1. 




tciicrrk: 



Torque 
controller 



Power 
converter 



Electromagnetic 
part of the motor 



Mechanical part 
of the drive 



Fig. 1. The classical cascade control structure of the two-mass system 

The inner control loop performs a motor torque regulation and consists of the power 
converter, electromagnetic part of the motor, current sensor and respective current or torque 
controller. As this control loop is designed to provide sufficiently fast torque control, it can be 
approximated by an equivalent first order term with small time constant. If the control is 
ensured, the driven machine could be an AC or DC motor, with no difference in the outer 
speed control loop. The outer loop consists of the mechanical part of the motor, speed sensor, 
speed controller, and is cascaded to the inner loop. It provides speed control according to the 
reference value (Szabat & Orlowska-Kowalska, 2007). 

Such a classical structure in not effective enough in the case of the two-mass system. To 
improve the dynamical characteristics of the drive, the modification of the cascade structure 
is necessary. In this paper the structure with the state controller which allows the free 
location of the closed-loop poles is considered. So it requires the additional information of 
the shaft torque and the load speed. The parameters of the control structures are set using 
pole-placement methods, with the methodology presented in (Szabat & Orlowska-Kowalska, 
2007), according to the following equations: 
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(9) 



where: §- - required damping coefficient, coq- required resonant frequency of the system. 

In the industrial applications, the direct measurement of the shaft torque m s and the load 
speed coz is very difficult. For that reason, in this paper the Nonlinear Extended Kalman 
Filter (NEKF) is used to provide the information about non-measurable mechanical state 
variables. Additionally, the time constant Ti of the load side is also estimated and used to 
on-line retuning the control structure parameters, according to Eq. (6)-(9). The estimated 
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value of Tie is also used to change the element qss of the covariance matrix Q in the way 
presented in the next section (Eq. (21)). The considered control structure is presented in Fig. 
2. The proposed adaptive control structure ensure the desired characteristic of the drives 
despite the changes of the time constant of the load machine. 



Twc^Mass System 
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Fig. 2. The block diagram of the state-feedbacks adaptive control structure 

3. Mathematical model of the nonlinear extended Kalman filter (NEKF) 

In the presence of the time-varying load machine inertia T2, there is a need to extend the 
two-mass system state vector (1) with the additional element I/T2 and non-measurable load 
torque mi : 



*r(0 = 
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(10) 



The extended, nonlinear state and output equations can be written in the following form: 

(lla) 
(lib) 



dt 
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y R (t) = C R x R (t)+v(t) 
where matrices of the system are defined as follows (in [p.u.]): 
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and w(t), v(t) - represent process and measurement errors (Gaussian white noise), according 

to the Kalman Filter (KF) theory. 

The matrix Ar depends on the changeable parameter T2. It means that in every calculation 

step this matrix must be updated due to the estimated value of T2. The input and the output 

vectors of the drive system (and NEKF) are electromagnetic torque and motor speed 

respectively: 



u = m e y = co l (13) 

After the discretization of Eq. (11) with T p sampling step, the state estimation using NEKF 
algorithm is calculated: 



x R (^ + l/yc + l) = x R (yc + l/^)+K(yc + l)[y R (^ + l)-C R (^ + l)x R (yc + l/^)] 



(14) 



where the gain matrix K is obtained by the suitable numerical procedure. 
In the first step the estimation of the filter co variance matrix is calculated: 

P(t + 1 / k) = F R (k)p(k)eZ (k) + Q{k) 



(15) 



where: 
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(16) 



and Q is a state noise co variance matrix. Fr is the state matrix of the nonlinear dynamical 
system (11) after its linearization in the actual operating point, which must be updated in 
every calculation step: 
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The filter gain matrix K of the NEKF and the update of the covariance matrix of the state 
estimation error P are calculated using the following equations: 



K(k + 1)= V(k + l/k)cl(k + l)[c R (£ + l)p(fc + l/k)cl(k + 1)+ R(^)]" 1 
P(t + l/* + l) = [l-K(t + l)C R (t + l)]p(t + l/t) 



(18) 
(19) 



where: R - the output noise covariance matrix. 

The quality of the state estimation depends on the suitable choice of the covariance matrices 
Q and R. However, according to the technical literature, the analytical guidelines which 
ensure proper setting of these matrices do not exist. Usually the trial and error procedure is 
used. However, this process is time-consuming and does not ensure the optimal 
performances of NEKF. In this paper elements of covariance matrices have been set using 
the genetic algorithm (Szabat & Orlowska-Kowalska, 2008), with the following cost function: 



F = \ ZK ~ m se\ Zh -®2e\ ZK ~ m Le\ Zl^ ~ T 2 



V i 



(20) 



where: m S/ co 2 , mi, Ti -real variables and parameter of the two-mass system; m se , co 2e , mu, ^ie 
-estimated variables and parameter, j - total number of samples. The cost function defined 
in this way ensures the optimal setting of covariance matrices Q and R for changeable time 
constant of the load machine. 

4. Simulation results 

4.1 Open-loop system 

In simulation tests the estimation quality of all system state variables is investigated. The 
shaft torque and the load speed are taken for the closed-loop structure with the direct 
feedback from system state variables (Fig.l). The electromagnetic torque and the motor 
speed, used as the input and output vectors of NEKF, are disturbed with white noises. In 
Fig. 3. the transients of the electromagnetic torque and motor speed are presented. 





Fig. 3. Transients of the electromagnetic torque (a) and the motor speed (b) 
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The drive system works in the reverse condition with the electromagnetic torque limit set to 
3 [p.u.] in the considered case is tested. The state estimator working outside the control 
structure is tested. The transients of all the real and estimated variables and theirs 
estimation errors are demonstrated In Fig 4. 

The NEKF starts work with a misidentified value of the time constant of the load machine 
(initial value of the Ti is set tol01.5ms - Fig 4.g). Then at the time h-7.s the time constant of 
the load machine Ti and the load torque mi begin to change (Fig. 4c, g). Those two variables 
vary in a smooth sinusoidal way. The NEKF estimates all the system states simultaneously. 
As can be seen from Fig. 4, the transients of all estimates contain high-frequency noises. The 
steady state level of the estimation error is about 0.02 (Fig. 4e) for the load speed and about 
0.10 (Fig. 4e) for the shaft torque. The biggest errors exist in the transients of the load torque 
and of time constant of the load machine (Fig. 4h). The initial estimation error of 1i, cause by 
the misidentified value of the time constant of the load machine is eliminated after 500ms. 
The typical disruptions can be seen in the estimated transient. They appear when the 
direction of the motor speed is rapidly changed. The characteristic feature of the NEKF is 
the fact that the estimation of the time constant of the load machine is only possible when 
the load speed is changing. Therefore, the biggest estimation errors occurs when the time 
constant of the load side is varied and the load speed is constant (Fig. 4g,h). The next NEKF 
feature is that the estimate of the Ti contains bigger frequency noises in the case when the 
real value of the T2 is larger. Because the load torque and time constant of the load machine 
have been varied in a smooth way good estimation accuracy has been achieved in the 
simultaneous estimation of all the states. 




h) 





Fig. 4. Transients of the real and estimated state variables and their estimation errors: load 
speed (a,d) shaft torque (b,e), load torque (c,f) and time constant of the load side (g,h) 
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Then the case of the rapid changing of the load torque and time constant of the load 
machine is considered. The input (electromagnetic torque) and output vector (motor speed) 
of the NEKF are presented In Fig. 5. As the previously the drive is working under reverse 
condition and the limit level of the electromagnetic torque is also set to 3 [p.u.]. The 
electromagnetic torque and the motor speed are disrupted by white noises, which emulate 
the measurements noises. The real and estimated variables and their estimation errors for 
rapid changes of the load torque and the load side inertia are presented in Fig. 6. 
Similarly as in the previous case, the drive system starts working with a misidentified time 
constant of the load machine T2=101.5ms (Fig. 6g). Then at the time t=ls and 3s the time 
constant of the load machine and the load torque change rapidly (Fig. 6c, g). Next, at the time 
t=5, 6 and 8s only the load torque and at the time t= 4, 6.5 and 8.5s only the time constant of 
the load machine vary quickly. The following work cycle allows to examine the quality of 
the variables estimation under different conditions. The average level of the estimation error 
is about 0.014 (Fig. 6e) for the load speed and about 0.06 for the shaft torque (Fig. 6f). 
However, the simultaneous alternation of the load torque and time constant of the load 
machine bring about the rise of the big, quickly damped estimation errors of the load speed 
(Fig. 6b) and shaft torque (Fig. 6d). A single change of the above-mentioned variables cause 
the increase of the estimation errors, but for a smaller extent than in the pervious case. The 
last two estimated variables, i.e. the load torque and the time constant of the load machine 
depend on each other significantly. The rapid change of one variable brings about a 
significant increase of the estimation error of the other variable (Fig. 6f,h). 





t[s] 



Us] 



a) b) 

Fig. 5. Transients of the electromagnetic torque (a) and the motor speed (b) 



Similarly as in the previous case, the drive system starts working with a misidentified time 

From the transients presented in Fig. 4 and Fig. 6 the following remarks can be formulated: 

-the estimation of the time constant of the load machine is possible only when the motor 

speed is changing; 

-the estimates of the load torque and the time constant of the load machine are correlated: 

the change of the load torque causes the rise of the error of the load machine time constant 

and vice versa. This is especially clearly visible in the transient presented in Fig. 6; 

-the noise level of the of the estimated load machine time constant of the strictly depends on 

the actual value of the real time constant and the value of the covariance matrix element (\ss) 

when of the value of the Ti is smaller, the element qss should have a bigger value and vice 



Adaptive control of the electrical drives with the elastic coupling using Kalman filter 



215 



The dynamic characteristics of the non-adaptive NEKF strictly depends on the proper 
setting of the covariance matrix values. In the case of the changeable time constant of the 
load machine the element qss is a compromise between the slow covariance for a small value 
of Ti and a large noise level when value of Ti is big. The modification of the estimating 
procedure is related to this feature. Because the noise level in the estimated variable 
depends on the real value of the Ti, the NEKF with the changeable element qss of the 
correlation matrix Q is proposed. The element qss adopts to the estimating value of the time 
constant of the load machine according to the following formula: 



T 



(21) 



where: qssN- the value of qss selected for the nominal parameters of the drive (using genetic 
algorithm), T2N - nominal time constant of the load machine, T2 e - estimated time constant 
of the load machine, n - power factor. 



a) 



b) 



c) 







d) 




Fig. 6. Transients of the real and estimated state variables and their estimation errors: load 
speed (a,d) shaft torque (b,e), load torque (c,f) and time constant of the load side (g,h) 
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Then the adaptive NEKF is tested under the same conditions as previously but with the 
adaptation formula (21). Because the biggest difference is visible in the time constant of the 
load machine only the transients of those variables are presented below. In Fig 7 the 
transients for smooth (case 1- a) and rapid (case 2- b) changes of the load torque and time 
constant of the load machine for power factor n=3 are presented. 

The difference between the non-adaptive and adaptive NEKF algorithm is clearly visible 
when the Fig. 4, 6 and 7 are compared. The estimate of Ti has a smaller estimation error and 
noise level than for the non-adaptive NEKF. The rapid changing of the load torque does not 
influence the estimate of T2 so significantly as in the previous non-adaptive NEKF case. 
Also the estimate of the load torque has better accuracy in the adaptive NEKF case. 
Similarly, the fast variation of the time constant of the load machine causes a smaller error in 
the estimate of load torque in the adaptive NEKF. 
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Fig. 7. Transients of the real and estimated time constant of the load side for the adaptive 
NEKF with power factor n=3, case-1 (a), case-2 (b) 
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In order to compare the performance of the non-adaptive and adaptive NEKFs, the 
estimation errors of all estimated have been calculated using of the following equation: 



H\ V ~ V e 



A =- 



(22) 



N 



where: N - total number of samples, v - real variable, 
v e - estimating variable. 



The estimation errors of all state variables for non-adaptive (n=0) and adaptive NEKF (n =3) are 
presented in the Table 1. 
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AC02 


Am s 


AT 2 


Am L 


Case 1 
n=0 


0.0092 


0.0456 


0.0180 


0.0942 


Case 1 
n=3 


0.0086 


0.0442 


0.0159 


0.0907 


Case 2 
n=0 


0.0140 


0.0605 


0.0301 


0.1073 


Case 2 
n=3 


0.0123 


0.0570 


0.0224 


0.0975 



Table 1. The estimation errors of the state variables for the case 1 and case 2 for the adaptive 
and non-adaptive NEKF 

The application of the adaptation mechanism decreases the estimation error in all estimated 
variables. This feature is especially evident when the time constant of the load machine and 
the load torque change rapidly (case -2). For instance, the application of the adaptation 
mechanism ensures the reduction of estimation error of the T 2e by approximately 25%. 

3.2 Closed-loop system 

First, the effectiveness of the proposed control structure has been investigated in the 
simulation study. The non-measurable state variables, e.g. shaft torque, load speed and load 
torque, are delivered to the control structure by the NEKF. 
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Fig. 8. Transients of the electromagnetic torque (a), motor speed (d), real and estimated state 
variables and their estimation errors: load speed (b,e), shaft torque (c,f), load torque (g,j), 
time constant of the load machine (h,k) adaptive control structure parameters (i,l) in the 
control structure with simultaneous estimation of the time constant of the load machine and 
load torque 

The estimated time constant of the load machine is used in the adaptation law in order to 
retune the control structure coefficients in accordance with (6)-(9). The adaptation formula 
(21) is used to improve the NEKF performance. However, in order to ensure the stable work 
of the control structure the coefficients of the covariance matrices are decreased in 
comparison to the previous section. The desired values of the resonant frequency of the 
system and the damping coefficient are <^=45s 1 and ^ r =0.7 respectively. The transients of 
the system states as well as the control structure coefficient are presented In Fig 8. 
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Fig. 9. Transients of the electromagnetic torque (a), motor speed (d), real and estimated state 
variables and their estimation errors: load speed (b,e), shaft torque (c,f), load torque (g,j), 
time constant of the load machine (h,k) adaptive control structure parameters (i,l) in the 
control structure with modified estimation algorithm 



The system starts work with a misidentified value of the time constant of the load machine 
Tie =101ms (Fig. 8h) which results oscillations in the estimated load torque transient. Despite 
this no visible oscillations appear in the transients of the load speed. After 2s, the estimate of 
the time constant of the load machine reaches its real value. The rapid changing of the load 
torque causes the oscillations in the estimate of Tie which are noticeable visible at the time 
f=9s. Still, a such big estimation error can not be accepted in the high performance drive 
system 

In order to improve the control structure performance the following modifications of the 
standard NEKF algorithm improving the quality of the estimation have been implemented. 
Firstly, the estimation of the time constant T2 is active only when the motor speed is 
changing. Secondly, during this time the estimation of the load torque mx is blocked. In the 
NEKF algorithm the last estimated value of the niL is used. Also, when motor speed is not 
changing, the estimate of T 2 is stopped and the estimate of the mi becomes active. During 
this time, the last estimated value of the time constant T2 is utilized in the algorithm. This 
modification allows to increase the values of the co variance matrices of the NEKF. 
All system states are reconstructed well and their estimation errors are very small and do 
not influence the system dynamics negatively (Fig. 9). The time constant of the load machine 
is estimated accurately with a small steady-stay error. The moments when the estimate of 
mie is stopped are visible in the load torque transient (Fig. 9g). Thus, the adaptive system 
with adaptive NEKF work properly. 
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5. Experimental results 

All theoretical considerations have been confirmed experimentally in the laboratory set-up 
composed of a 0.5kW DC-motor driven by a static converter. The motor is coupled to a load 
machine by an elastic shaft (a steel shaft of 5mm diameter and 600mm length). The speed 
and position of the driven and loading motors have been measured by incremental encoders 
(36000 pulses per rotation). The mechanical system has a natural frequency of 
approximately 9.5Hz. The nominal parameters of the system are Ti=203ms, T2=203ms, T c 
=2.6ms. The picture of the experimental set-up is presented in Fig. 10. 




Fig. 10. The mechanical part of the laboratory set-up (a) and the general view of the 
laboratory set-up (b) 
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Fig. 11. Real transients of the: motor and load speeds (a), real and estimated load speeds and 
its estimation error (b), electromagnetic and estimated shaft and load torque (c), estimated 
time constant of the load side (d), control structure parameters (e,f) -for the reference value 
of the speed cor=03 



First the performance of the drive system has been tested for the nominal value of the time 
constant of the load machine T2=0.203s. The electromagnetic torque limit has been set to 2. 
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The system works with the reference value of the speed set to 0.5. According to the 
adaptation procedure described in the previous section during start-up the estimate of the 
rnie is blocked and the estimate of the Ti e is activated which is observable in Fig. llc,d. When 
the control error decreases below 0.05, the estimate Ti e is blocked and the mi e . At the time 
ti=OAs the nominal load torque is applied to the system. This affects the system speed in a 
negative way and some disruption is visible in its transients. The load torque is switched off 
at the time t2=0.8s and the non-zero value of the estimate of the mu comes from the friction 
torques. At the time h=\s the system begins to reverse. When the value of the system speed 
is negative, no external torque is applied to the system. The drive reverses again at the time 
t4=2s and then the work cycle is repeated. Clearly, the adaptive control structure with the 
NEKF works properly. The load speed as well as the time constant of the load machine are 
estimated with small errors. The transients of the control structure parameters are presented 
in Fig. 11 e,f. They vary (except ki) with the estimated value of the Ti e . 

Next the control structure with the electromagnetic torque limit set to 3 has been examined. 
The work cycle is identical as previously. But the reference speed is set to the nominal value. 
The transients of the system are presented in Fig. 12. 

Similarly as before, the initial value of the time constant of the load machine is set to 
T2e=0.1015s. After the start-up it reaches its real value almost without an error. During the 
next reversal the estimate of the T2 oscillates around the real value. However, it should be 
pointed out that the estimation error does not exceed a few percent of the real value. The 
estimate of the T2 is reconstructed very well. Small errors appear in its transient during the 
time when the load torque is switched on and off and during the reversal. The adaptive 
control structure with the state controller works in a stable way. 

6. Conclusion 

In order to damp the torsional vibrations, which could destroy the mechanical coupling 
between the driven and loading machine, the control structure with state controller is 
applied. The control structure coefficients depend on the time constant of the load side 
machine. In the case of the system with changeable load side inertia, there is a need to 
estimate this parameter and adapt the control structure gains in accordance with the actual 
estimated value. The application of the adaptive control structure ensures the required 
transient of the load speed despite the changeable load side inertia. In order to use the 
adaptive control structure, there is a need to choose a state estimator, which has to estimate 
the non-measurable system state variables and changeable parameters of the system. In this 
paper, the non-adaptive and adaptive nonlinear extended Kalman filter (NEKF) is tested. 
Parameters of the covariance matrices Q and R are selected using the genetic algorithm with 
special cost function. The application of the global optimization technique allows to reach 
the global solution according to the defined cost function. However, the application of the 
genetic algorithm is possible only as an off-line process due to a long calculation time. To 
ensure the optimal values of the covariance matrix Q, despite the load side parameter 
changes, the adaptation mechanism is developed. The suitable on-line change of the 
covariance matrix element c\ss is proposed, according to the estimated value of the load side 
time constant. It is proved by simulation and experimental tests that the proposed control 
structure is effective for damping the torsional oscillation of two-mass drive system, also in 
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the case of wide range changes of load side inertia. 
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1. Introduction 

The so-called Sandwich system with hysteresis is a class of systems in which a hysteretic 
subsystem is sandwiched between two smooth dynamic blocks. In engineering, many 
practical processes can be considered as the sandwich systems with hysteresis. In the 
following, two typical examples will be presented. 

1.1 Ultra-precision moving positioning stage 

A typical ultra-precision moving positioning stage is often used in ultra-precision 
manufacturing system for its nanometer displacement and fast linear moving speed. 
Usually, such platform consists of electric amplifiers, piezoelectric actuators and loads. As 
hysteresis is inherent in piezoelectric actuator, the amplifier and load can be considered as 
smooth dynamic subsystems. Therefore, this platform can be considered as a typical 
sandwich system with hysteresis. Fig.l shows the architecture of such system. 




Fig. 1. Architecture of ultra-precision moving stage with piezoelectric actuator 



1.2 Mechanical Transmission System 

Mechanical transmission system often exists in machine tools or many other mechanical 
systems. A typical mechanical transmission system is shown in Fig.2. In this system, the 
servomotor is used to drive a gearbox connected with a mechanical work platform through 
a screw. In this system, u is the servomotor angle, x is the angle of the gearbox, and y is the 
displacement of the work platform. The servomotor and the work platform can be 
considered as smooth dynamic subsystems. However, the gearbox and screw in this system 
is a typical hysteresis nonlinearity due to the tear and wear of the gear teeth. Obviously, this 
mechanical system can be described by the sandwich system with hysteresis. 
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Fig. 2. Mechanical transmission system 



Although, sandwich systems with hysteresis often exist in engineering practice, there are 
only several research reports found on the control of them. Taware & Tao (1999) presented 
an analysis on the control of such systems with backlash-type hysteresis. Tao & Ma (2001) 
proposed an optimal control for the systems with sandwiched backlash. In their methods, an 
optimal control scheme is employed for backlash compensation. Then, the nonlinear 
feedback control law is used for the control of nonlinear dynamics. Zhao & Tan (2006) 
proposed a neural adaptive control for sandwich systems with hysteresis. The neural 
network based hysteresis compensator is developed to compensate for the effect of the 
hysteresis. Furthermore, Zhao et. al. (2007) presented an adaptive control strategy for 
sandwich systems with dynamic hysteresis based on Duhem hysteretic operator. Corradini 
et. al. (2007) proposed a variable structure control of nonlinear uncertain sandwich systems 
with hysteretic block. Therefore, the control of sandwich systems with hysteresis has 
become one of the interesting topics in control engineering domain. 

It is known that the existence of hysteresis in actuators often leads to oscillation and 
undesirable inaccuracy. Therefore, the main purpose of design a control scheme for 
sandwich system with hysteresis is to eliminate the side effect of hysteresis inherent in the 
system and force the system to track the reference trajectory. Note that hysteresis is a non- 
differentiable nonlinear system with multi-valued mapping. Moreover the structure of the 
sandwich system is rather complex. Hence, it is not easy to construct a compensator for the 
hysteresis in such system. Therefore it is a real challenge to develop a control strategy for the 
dynamic systems with sandwiched hysteresis. 

In this chapter, a mathematical description of the sandwich systems with hysteresis will be 
described in section 2. Then, in section 3, the control architecture for the sandwich systems 
with hysteresis will be illustrated. In this architecture, a neural network based inverse model 
is constructed to cancel the effect of the first dynamic block of sandwich system. Then, the 
sandwich system can be transformed to a nonlinear system preceded by hysteresis which 
can be described by a Hammer stein model with hysteresis. In Section 4, a neural network 
based estimator will be developed in terms of a proposed expanded input space with 
hysteretic operator. The developed neural hysteretic estimator can be used to compensate 
for the system residual caused by the effect of hysteresis. Section 5 will present an adaptive 
control strategy based on pseudo inverse control technique for the obtained Hammerstein 
system with hysteresis. One of advantages of the controller is that it does not need to 
construct the hysteresis inverse to cancel hysteretic effect. The neural control strategy and 
the corresponding adaptive law based on the Lyapunov stability theory will be developed. 
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Furthermore, Comparison of the simulation results between the proposed method and the 
PID control strategy will be illustrated in Section 6. Section 7 will present the remarks and 
conclusions of this Chapter. 



2. Mathematical Description of Sandwich Systems with Hysteresis 

The structure of the sanwich system with hysteresis is shown in Fig.3. Suppose the 
nonlinear single-input-single-output (SISO) system with sandwiched hysteresis is described 
by 



L r - f i [v l "\v i "- l \-,v m ,v,r im \r im - l \-,r m ,r] = 



(1) 



where Y is the input, V is the output, V n is the n-th order derivative of V , r {m) is m-th order 
derivative of Y , m and n (m <n) are the orders of the input and output respectively. 



H : u = H(y) 
where H presents the hysteresis nonlinear ity. 



(2) 



L- \ 



(3) 



K =fo( X ) + go( X ) U 



and 



y = x Y 



(4) 



where x = [x { , x 2 , • • • , x n ] is the system state vector, u is the input, y is the output, v is the 
control input and u is the actuator output. It is assumed that f o {%) and g o (jc) are sufficiently 

smooth but unknown functions and satisfy -^- ^ and -^- ^ . Moreover, assume that f o is 
invertible. Notation h[ .] denotes that the hysteresis nonlinearity is not dependent on an 
instantaneous value v (0 but the trajectory, v(t)eC°[0,t]- Assume that all the control and 
input variables, i.e. v {n) , v {n ~ l) , • • • , v (1) , v, r {m) , r (m_1) , • • • , r (1) , r dire known. 



► 



Fig. 3. The structure of sandwich system with hysteresis 
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3. Control Architecture for Sandwich System with Hysteresis 

From Fig. 3, it is known that the architecture of the sandwich system with hysteresis is 
rather complex. It would be convenient for us to design a control strategy for such system if 
we could find a method to simplify the structure of the system. In this section, a control 
architecture for the sandwich system will be discussed. In this architecture, a neural 
networks (NN) based inverse system fj x will be constructed. By connecting the NN based 

inverse with the system L can form an approximate pseudo-linear unit compensator which 

leads to L~ l L. « 1 . Then the sandwich system can be transform to a pseudo-linear unit 

system connected with a nonlinear system preceded with hysteresis nonlinearity which is 
shown in Fig.4. The obtained the system can be considered as a Hammerstein System with 
hysteresis. 
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Fig. 4. The sandwich system with a pseudo-linear unit compensation 

With the above-mentioned NN based inverse, the effect of L. would be cancelled. So we can 
design the controller for the system L o preceded by a hysteresis nonlinearity. Usually, the 
model uncertainty of the neural network based compensator exists. That implies the NN 
based compensator cannot completely compensate for the effect of L ■ Therefore, a model 

residual should be added to system L o . That is L~ l L =l + g, where E, is a bounded modeling 
error. Hence, the obtained system preceded by a hystersis can be described as follows: 



H : u = H(y) , 



(5) 



(6) 



*„=/«,(*) + &,(*)« + £ 



and 



y = x 1 



(7) 



The control objective is to design a control law v (t) to force y(t) , the plant output, to track a 
smooth prescribed trajectory yAt) with an acceptable accuracy. The desired state vector is 



defined as x d (t) = [y d 



>y d >'-y d 



] f where y d n ~ 1} is the ( n -\)th order derivative. Moreover, the 



tracking error vector is defined as 



-x . It is assumed that the desired states are 
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bounded, i.e. Ik | < x d • Moreover, £ denotes bounded disturbance caused by NN based 
inverse, where \g I < ^ , and ^ > • 
Define the filtered tracking error as 



T = [A l ,A 2 '--A n _ l ,l]e = [A T ,l]e 

where A = [A l ,A 2 --- \_ x ] T is a parameter vector to be designed. Suppose 

s n - l +\_ lS n - 2 +--- + \ 
is Hurwitz. Differentiating (8) and using (6), it results in 

t = x lt -y:+[0,A T ]e = f o (x) + g o (x)u-y:+[0,A T ]e + t 



(8) 



(9) 



As u is the output of hysteresis which is usually unknown, an invertible function f(x, v) is 

introduced to approximate f (x) + g (x)u . Adding and subtracting f(x, v) to and from the 
right hand side of (9), it yields 



T = S + f o (x) + g o (x)u-f(x,v)-/; ) +[0,A T ]e + Z 
= S + F(x, u) - f(x, v) - y™ + [0, A r ]e + 1 
= S + f(x,v,u)-y ( ; } +[0,A T ]e + 4 



(10) 



where 8 = f(x, v) is the so called pseudo-control (Calis & Hovakimyan, 2001) and 

(Hovakimyan & Nandi ,2002), F(x, u) = f o (x) + g o (x)u and f(x, v, u) = F(x, u) - f(x, v) is the 

system residual. As f(x, v) is invertible with respect to v and satisfies (Calis & Hovakimyan, 
2001): 



dF du df 

l.sgn = sgn — , 

du dv dv 



(ii) 



and 



dF du 

du dv 



>0. 



(12) 



In order to design the corresponding control strategy, the approximation of the nonlinear 
residual f(x,v,u) is required. Neural networks would be one of the recommended 
alternatives to model this residual. However, f(x,v,u) involves the characteristic of 
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hysteresis, the traditional nonlinear identification methods such as neural modeling 
technique usually cannot be directly applied to the modeling of it since the hysteresis is a 
non-linearity with multi-valued mapping (Adly & Abd-El-Hafiz, 1998). In Section 4, we will 

present a method to construct the neural estimator for f(x, v, u) to compensate for the effect 

of hysteresis. Moreover, a corresponding adaptive control method based on the control 
archieture stated-above will be illustrated in Section 5. 

4. Neural Estimator for System Residual 

In order to approximate the system residual, neural network can be considered as an 
alternative. However, the system residual contains the characteristic of hysteresis which is a 
system with multi-valued mapping. In this section, a hysteretic operator is proposed to 
construct an expanded input space so as to transform the multi-valued mapping of 
hysteresis into a one-to-one mapping (Zhao & Tan, 2008). Thus, the neural networks can be 
used for modeling of hysteresis based on the expanded input space with the hysteretic 
operator. The proposed hysteretic operator is defined as: 

h(x) = (\-e- lx - x ' l )(x-x p ) + h(x p ), (13) 

where x is the current input, h(x) is the current output, x p is the dominant extremum 
adjacent to the current input x . h(x p ) is the output of the operator when the input is x p . 

Lemma 1: Let x(t)eC(R + ) , where j? + ={* |* >0} and C(R + ) are the sets of continuous 
functions on R + . If there exist two time instants t x , t 2 and t x ^t 2 , such that x(t x ) = x(t 2 ), 
x(t x ) and x (t 2 ) are not the extrema, then h^^)]* h[x(t 2 )] . 

Proof: For x(t) decreases or increases monotonically, (13) becomes 

\h(x) = [l-e~ (x ~ Xp) ](x-x n ) + h(x\ jc(f)>0 
h(x) = \ m P P (14) 

[h de (x) = (1 - e x ~ Xp )(x -x p ) + h(x p ), x(t) < 

ti m (x) = e- (x - x > ) .(x-x p )Hl-e- (x - Xp) ] 

= \-[\-(x-x p )]/e x - x > . (15) 

>\-\/e x ~ Xp >0 

Therefore, h in (x) is monotonic. Similarly one can obtain that h de (x) is monotonic. It is noted 
that h in (x) is obtained from h in0 (x) = (l-e~ x )x (x>0) • That means its origin moves 
from (0,0) to (x p ,h(x p )) . Similarly h de (x) is obtained from h deQ (x) = (l-e x )x (x<0) . It 
represents that its origin moves from(o,o) to (x ,h(x )) . As h in0 (-x) = -h de0 (x) , it implies 
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that h in {x) and h de (x) are antisymmetric. Therefore it can be concluded that h in (x) and h de (x) 
intersect only at extrumum point (x ,h(x )) . That is, if x(^) and x(t 2 ) are not the extrema, 
xft) = x(t 2 ), then h[x(t { )] *h[x(t 2 )] ■ 

Remark: If both h(x) and H [•] are fed with the same input v(7) , the curve of /*[v(/)] exhibits 
similarity to that of H[v(t)] such as ascending, turning and descending. Moreover, 
since x(t l ) = x(t 2 ) , x(t x ) and x(t 2 ) are not the extrema, hlxitM^hlxitS] > the pair 
(y(t),h[v(t)]) will uniquely correspond to one of the output values of hysteresis H[v(t)] • 
Lemma 2: If there exist two time instants t x , t 2 and t x *t 2 , such that h[x{tj\-h[x{t 2 )\ -> , then 
xft)-x(^ 2 )^0. 
Proo/: 

»»L VUJ 4 ^;j =L ^ e (0,+oo), (16) 



t(t x )-x(t 2 ) 



and 



c(*i)-*(f 2 ) = 



^0]-^[*(0] ^ (17) 



It is clear that if h in [x(t x )~\-h in [*(* 2 )] -^ , then x (t x ) - x(t 2 ) -> . Similarly, it is obtained that 
if h de [*(*! )] - /^ [jc(r 2 )] ->• , then xit^ -x(t 2 ) -» .Thus, it leads to the following theorem, i.e.: 

Theorem 1: For any hysteresis, there exists a continuous one-to-one mapping T:r 2 -^R, such 
that H[v(t)] = T(v(t),h[v(t)]) , where {v(7),/z[v(7)]} is an expanded input space with 
hysteresis operator. 

Proof: The proof can be divided into two cases, i.e. 

Case 1: If v(t) is not the extrema. Based on Lemmal, if there exist two time instants t x , t 2 

and t x *t 2 , then (v(t 1 ),h[v(t 1 )'^^(y(t 2 ) i h[v(t 2 )'^ • Therefore, the pair (v(t),h[v(t)]) uniquely 

corresponds to an output value of H[v(t)] . 

Case 2: liv(t) is the extrema, then( v (^),Arv(^)l) = (v(t 2 ),h\v(t 2 )\) • According to the principle 



of the classical Preisach modeling, i.e. H[v(t 1 )] = H[v(t 2 )], then the pair r uniquely 

corresponds to an output value of H[v(t)] • 

Combining the above-mentioned two cases, there exists a mapping r : R 2 -» R such that 

H[v(t)] = r(v(t),h[v(t)])- 

In theorem 1, the obtained mapping r (.) is a continuous function. According to Lemma 2, 

from v &)-v(f 2 )->0, it leads to H[v(t x )]-H[v(t 2 )] -> . Also, from h[v{t x ]\-h[v{t 2 )\^0 f it 
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yields v(t x ) - v(t 2 ) -» . Then, it results in H[v(t r )] - H[v(t 2 )] -> . Therefore, it is derived that 

T is a continuous function. Moreover, Theoreml indicates that the multi-valued mapping of 

hysteresis can be transformed to a one-to-one mapping. It can be proved that the obtained 

mapping is a continuous mapping, i.e. 

Let T = [t ,oo) e R , V = {v \T^^>R} . Also let F = {h \ T^^R} be the input sets. Given t t e T □ 

it is obvious that v(t f ) <+ooand/*[v(/.)]<+Qo . So mat( v (^.),/?[v(7.)])e 7? 2 • Thus, it is obtained that 

® = (WO' MHO]) I v ft) e ^ ? MHO] e F} is a compact set . 

Hence, it provides a premise to apply neural networks to modeling of the behavior of 

hysteresis. Based on the proposed expanded input space with hysteretic operator, a neural 

network is used to approximate the system residual, i.e. f(x, v, u) : 

~f{x.v,u) = W T a(V T x m ) + e(x m ) (18) 

where a(-) is activation function, V is the first-to-second layer interconnection weights, 
W is the second-to-third layer interconnection weights, x nn =(x ,S,u) , £ is the NN 
functional reconstruction error, ||^(x^)|| < s N , and s N > . 

The above-mentioned neural network based on the expanded input space with hysteretic 
operator can be used to construct the corresponding neural estimator for the system residual 

f(x, v, u) . Thus, it can be used for the compensation for the effect of the hysteresis inherent 

in the sandwich system. 

5. Adaptive Control Strategy 

In section 3, we introduce an architecture of the control strategy for the sandwich system 
with hysteresis. In the control structure, a neural inverse model is used to compensate for 
the effect of L. in the architecture of the sandwich system with hysteresis. After the 

compensation, the sandwich system with hysteresis is approximately tranformed into a 
Hammerstein system with hysteresis. In this section, an adaptive control strategy is 
developed for the obtained Hammerstein system with hysteresis. 

Assumption 1: If the weight matrices, i.e. V dindW of the neural estimator are respectively 
bounded by V > and W > , i.e. \\w\\ F < W p and ||K| < V p ,where |-| represents Frobenius 
norm. Then, the corresponding pseudo-control can be chosen as 

S = y^-KT-[0,A T ]e-v ad+ v r (19) 

where v r is the term for robust design, K is a design parameter, v ad is the output of neural 
network, i.e. v ad - W T <j(V t x nn ) where W and V are the estimated values of W and V . 
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From (10) and (19), notice that f(x, v, u) depends on v ad through 8 . However, v ad has to 

be designed to cancel the effect of f(x, v, u) . This should assume that the mapping 

8 ad l— > / is a contraction over the entirely interested input domain. It has been proven by 
Hovakimyan and Nandi (2002) that the assumption is held when (11) and (12) are satisfied. 
Using (18) and (19), (10) can be written as 

t = -Kt - W T a{V T x nn ) + W T a{V T x nn ) + v r + s + % . (20) 

Difine 

V = V-V and W = W-W . (21) 

The Taylor series expansion of cr(Vx nn ) for a given X m can be written as 

cr(Vx nn ) = cr(Vx nn ) + cr \Vx nn )Vx nn + o(Vx nn f (23) 

where cr\z) = d<j(z)/dz \ z=s and o(z) is the term of order two. Denoting cr = cr(V x nn ) , 
g - <j{V x nn ) , and & y — g \V x nn ) , with the procedure as Appendix, we have 

i = -KT + W\a-GW T x n J + W T G'V T x nn +v r +s + % + w (24) 

where 

w = W T (g - g) + W t gT t x nn -W T gT t x nn . (25) 

An upper bound for w can be presented as: 

\\w\\ < \\W\1 + 1^1 ||o-T 2 'jc iw I + \\V\\ F \\ x nnW T cr'\\ F (26) 

or 

\\w\\<p w & w (W,V,x m ) (27) 

where 3 W = 1^&'V t xJ^^xJV t <j'J andp w =max(|^| 1 ,|^|,|K| F ) . 

Theorem 2: Let the desired trajectory be bounded. Consider the system represented by (5), (6) 
and (7), if the control law and adaptive law are given by 
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and 



v = f-\x,S) 
S = yf-Kr-[0,A T ]e-v ad+ v r 

W = F[{a-aT T x m )r-kW\r\[ 

V = R[x m W T &'T-kV\\T\\] 

^=rfH($,+i)-*H*] 



-0(3 w +l)y- |r||*0 



(28) 
(29) 

(30) 

(31) 

(32) 



(33) 



0, 



\\t\\*0 



where F = F T > , R = R T > , f > , fZ* = max|> M ,,(£ w +£„)] , and $=<f>-(j>; then the 
signals e , JF , F , and ij) in the closed-loop system are ultimately bounded. 



Proof: Consider the following Lyapunov function candidate, i.e. 

L = -T 2 +-tr(JV T F-'JV) + -tr(V T R-'V) + -<ji> T y-'<j!> 

The derivative of L will be 

L = vi + tr(W T F-'W) + tr(V T R~ l V) + T y~ l 
Substituting (20) into (35), it yields 

L = -KT 2 +rv r +t(w+ s + %) + 4> T y- l <f> + trW T [F~ { W + (<j -&'V T x m )r] 
+ trV T {R~ l V + x m TW T a') 



(34) 



(35) 



(36) 



Substituting W = —W and V = —V into (30) and (31) , (36) can be rewritten as 

L = -Kt 2 + rv r + t(w + s + £) + ^ V 1 ^ + * H [tr(W T W) + /r(F r F)] . (37) 
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Considering (27) and (/) = max[p w ,(£ N + £ N )] ,we obtain 



L < -Kt 2 + rv r + \t\ (f)(3 w + 1) - (jyy- x (j) + k \r\ [tr(W T W) + tr(V T V)] . 
Substituting (32) and (33) into (38), it results in 



L < -Kt 2 + k \\t\\ [tr(W T W) + tr(V T V) + T 0] 



Defining 



(38) 



(39) 



w 





o~ 







V 





' z = 








4> 





w 





o" 







V 





and z = 








*_ 





W 





0" 





V 











$_ 



(39) 



can be rewritten as 



L < -Kr 2 +k\r II tr(Z T Z). 



(40) 



As /r(Z 7 Z)< Z |Z|| F - Z , 
it leads to 



L<-KT l +k\\T\\(\\Z\\ \\Z\\ -\\Z\\ ) . 
II ll v || \\ F II IIf II Wp' 



(41) 



That is 



ii ii l ii ii v || \\ F ^ ' /. J 



(42) 



„ „ A: Z _ n~ 

Thus, Z is negative as lone as either r > — - — — or Z > Z . This demonstrate that r , 

& & II II 4^ || || F II \\f 

W , V , and ^ are ultimately bounded. According to Assumption 1 and the definition of 

r and <j> , we can obtain that the variables e , W , V and ^ in the closed-loop system are 
ultimately bounded. 



6. Simulation 

In order to illustrate that the proposed approach is applicable to nonlinear system with 
sandwiched hysteresis, we consider the following nonlear system: 
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V 9 

L , v = -0.2(sinv-cosv) - + (0.4 sin v cos v+ 0.8V, v(0) = 

1 + v 2 

H , The hysteresis is generated by the sum of TV = 50 backlash operators, i.e. , 

TV 

u = H[v(t)] = y^u f , and 



v(t) v(t)>0, Ui (t) = v(t)-^- 



d. 
u ; = i v(t) v(t) < 0, u i (t) = v(t) + — 

otherwise 



where u i and d i are respectively the output and the dead-band width of i - th backlash 
operator where i = 1,2,--N ( N > is a positive integer) .The values of the dead-band 
widths are evenly distributed within [0.02,1] . All the initial outputs of the operators are set 
to zero. Fig. 5 shows the response of the hysteresis contained in the system. 



I Ji'y 11 JLi /-^9 ■A'l ' Mr 

and 

y = x x . 

The design procedure of the controller for the snadwich system with hysteresis will be 
shown in the following. 

1) Construction of nerual network based L inverse. An artificial neural network unit 
inverse , i.e. L] is constructed to cancel the effect of the first dynamic block, i.e. L. . 
The system is excited by the input r l (t) = sin2t + cost. Then, 500 input/ output 
pairs of data {^(v^v,)} are obtained. Using these data as learning samples, a 

neural network based inverse ZT is constructed. The architecture of neural 

network based inverse model consists of 2 input nodes, 10 hidden neurons and 1 
output neuron. The sigmoid function and linear function are respectively used as 
activation function in the hidden layer and the output layer. The conjugate 
gradient algorithm with Powell-Beale restarted method (Powell, 1977) is used to 
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train the neural network. The compensation result of the NN based L. is shown in 
Fig. 6. It is known that there are some larger error happened in the beginning. Then 
it is gradually reduced as the control proceeded. 




-0.2 -0.15 -0.1 -0.05 O 0.05 0.1 0.15 0.2 0.25 



Fig. 5. The hysteresis in the system 



\ 



\,^^\rY 



Fig. 6. The compensation error of NN based L. 1 



2) Neural approximator of system residual: The neural network used to approximate f(x,v,u) 
consists of 4 input nodes, 35 hidden neurons and 1 output neuron. The input of the NN 

T 1 

is x nn =(x ,S,u) . The activation function is a(x) = 



\ + e x 



3) The selection of the controller parameters: The other parameters of the controller are 
respectively chosen as A l =2,K = U,k = 0.001 , y = 0.1 , f(x, v) = v , F = 8/ , and R = 51 , 

where / is the unit matrix. 

4) PID control for comparison: In order to compare the control performance of the proposed 
control strategy with the PID controller , we choose 

v(t) = —22e x + [ e x dt — I3e 2 
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where e l = y — y d , e 2 — y — y d . Moreover, the desired output of the system 
is y d (t) = 0. l;r[sin It - cos t] . 






(V I N N i 



Fig. 7. The control response of the proposed method 

From Fig. 7, it is known that the control performance of the proposed controller has 
achieved good control response. Also, Fig.8 illustrates the control performance of the PID 
controller. It can be seen that the PID control strategy has led to larger control error when 
the reference signal achieves its local extreme. However, the proposed control strategy 
obtained better control performance. It can obviously derive more accurate control result. 

7. Conclusions 

An adaptive control strategy for nonlinear dynamic systems with sandwich hysteresis is 
presented. In the proposed control scheme, a neural network unit inverse is constructed to 
compensate for the effect of the first smooth dynamic subsystem. Thus, the sandwich system 
with hysteresis can be transformed to a Hammerstein type nonlinear dynamic system 
preceded by hysteresis. Considering the modified structure of the sandwich system, an 
adaptive controller based on the pseduo-control technique is developed. In our method, a 
neural network is used to approximate the system residual based on the proposed expanded 
input space with hysteretic operator. The advantage of this method can avoid constructing 
the hysteresis inverse. Then, the adaptive control law is derived in terms of the Lyapunov 
stability theorem. It has been proved that the ultimate boundedness of the closed-loop 
control error is guaranteed. Simulation results have illustrated that the proposed scheme has 
obtained good control performance. 




Fig. 8. The control response of the PID control method 
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8. Appendix 

From (20), the approximation error can be written as: 

W T <J-W T & 

= W T a-W T a + W T a -W T & (Al) 

= W T CT + W T (cr-a) 

Substituting (23) into (Al), it yields 

w T <j-w T a 

= w T {a+a<P x nn+0 {v T xj 2 ) + w T {a<P x nn+0 {P x nn f) 

= w T a+w T aT T x nn + w T aw T x nn +w T o(y T x nn f (A2) 

= w T a + w T a ' v T x nn - w T a • v T x m + w T a • v T x m + w T o(v T x nn f 

= w T (a-a'v T x m )+w T a'v T x m +w T a } v T x nn +w T o{v T x nn f . 

Defining w = W T a y V T x nn + W T o{V T x nn f , (A2) becomes 

w T <j-w T a 

= w T {a -aw T x nn )+w T a*v T x nn +w . 

So that 

w = w t g - w T a - w T (a - a • F x nn ) - # r 6- • v T x nn 
= w T a-w T a+w T a'Px nn -w T aw T x nn 
= w T {g -a)^w T a y v T x m -w T a y v T x m -w T a y v T x nn 
= w T (g -a)+w T a*v T x nn -w T a y v T x nn 
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1. Introduction 

The traffic volume of the data transmission is increasing each year with the explosive 
growth of the Internet. The networking technologies supporting the data transmission are 
optical fiber transmission technologies. In the physical layer, the networks are classified into 
three networks, the long-haul network that connects city to city, the metropolitan area 
network that connects the central station in the city to the neighboring base station, and the 
access network that connects the base station to the home. In order to adapt to the increase 
of the data transmission, we need to achieve high-speed transmission and increase the 
capacity of transmission in each network. 

In the access network, many kinds of passive optical networks (PON) are studied to offer a 
high-speed access to the Internet at low cost. In the metropolitan area network, we 
contemplate the update of the network structure from the conventional peer-to-peer 
transmission to the ring or mesh structure for the high-capacity and highly reliable networks. 
In the long-haul network, the study on multilevel modulation such as the differential 
quadrature phase shift keying (DQPSK) is a recent popular topic for the high-capacity 
transmission because the multilevel modulation utilizing the phase information offers high- 
speed transmission without increasing the symbol rate. Other modulation and multiplexing 
technologies are also studied for the high-capacity networks. The orthogonal frequency 
division multiplexing (OFDM) is one of the wavelength division multiplexing methods and 
achieves high spectral efficiency by the use of orthogonal carrier frequencies. The optical 
code division multiple access (OCDMA) is a multiplexing technique in the code domain. 
These techniques are developed in the wireless communication and modified for the optical 
transmission technologies in these days. 

In the long-haul and the metropolitan area networks whose transmission distance is over 10 
km in 40 Gb/s, chromatic dispersion (CD) is one of the main factors which limits the 
transmission speed and the advances of the network structure. The CD is a physical 
phenomenon that the group velocity of light in the fiber depends on its wavelength 
(Agrawal, 2002). The CD causes the degradation of the transmission quality as the optical 
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signals having a spectral range are distorted by the difference of the transmission speed in 
the wavelength domain. The effect of dispersion increases at a rate proportional to the 
square of the bit-rate. 

In the high-speed optical transmission over 40 Gb/s, we have to compensate for the CD 
variation caused by the change of strain and temperature adaptively in addition to the 
conventional static CD compensation because the dispersion tolerance is very small in such 
a high-speed transmission. Also, in metropolitan area networks employing reconfigurable 
networking technology such as the mesh or ring network, the transmission route changes 
adaptively depending on the state of traffic and the network failure. As the CD value 
depends on the length of the transmission fiber, we have to compensate for the relatively 
large CD variation caused by the change of the transmission distance. 

With the aforementioned background, many researches and demonstrations have been 
conducted in the field of the adaptive CD compensation since around 2000 (Ooi et al., 2002; 
Yagi et al., 2004). The adaptive compensations are classified into two major groups, the 
optical compensations and the electrical compensations. In the electrical compensation, we 
utilize the waveform equalizer such as the decision feedback equalizer (DFE), the feed 
forward equalizer (FFE) or the maximum likelihood sequence equalizer (MLSE) after 
detection (Katz et al., 2006). These equalizers are effective for the adaptive CD compensation 
because they act as a waveform reshaping. The compensation based on DEF and FFE has 
advantages that the equalization circuit is compact and implemented at low cost. However, 
the compensation range is limited because the phase information of the received signal is 
lost by the direct detection. The MLSE scheme is very effective in 10 Gb/s transmission. 
However it is difficult to upgrade high bit-rate over 40 Gb/s because the scheme requires 
high-speed A/D converter in implementation. 

In the optical domain, the adaptive CD compensation is achieved by the iterative feedback 
control of a tunable CD compensator with a real-time CD monitoring method as shown in 
Fig. 1. Many types of tunable CD compensators are researched and developed recently. The 
tunable CD compensator is implemented by the devices generating arbitral CD value. Also, 
many kinds of CD monitoring methods are studied and demonstrated for the feedback 
control of tunable CD compensators. While the compensation devices and the dispersion 
monitoring methods are studied with keen interest, the adaptive control algorithm, how to 
control the tunable CD compensator efficiently, has not been fully studied yet in the optical 
domain CD compensation. When the tunable CD compensator is controlled iteratively for 
the adaptive CD compensation, the control algorithm affects the speed of the compensation 
to a great degree as well as the response time of the compensation devices and the 
monitorings. Although the simple hill-climbing method and the Newton method are 
employed as a control algorithm in many researches and demonstrations, these algorithms 
are not always the best control algorithm for the adaptive CD compensation. 
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Fig. 1. Adaptive CD compensation in the receiver. 
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In this chapter, we report the adaptive CD compensation employing adaptive control 
technique in optical fiber communications. We propose a high-speed and low cost adaptive 
control algorithm based on the steepest descent method (SDM) for feedback control of the 
tunable CD compensator. The steepest descent approach has an ability to decrease the 
iteration number for the convergence. We conducted transmission simulations for the 
evaluation of the proposed adaptive control technique, and the simulation results show that 
the proposed technique achieves high-speed compensation of the CD variation caused by 
the change of the transmission distance in 40 Gb/s transmission. 

The organization of this chapter is as follows. In Section 2, we explain the fundamentals of 
CD and adaptive CD compensation in optical fiber communications for the background 
knowledge of this research. Then we propose the adaptive control technique based on the 
SDM for adaptive CD compensation in Section 3. In Section 4, we show the demonstrations 
and performance analysis of the proposed technique in 40 Gb/s transmission by simulations. 
Finally, we summarize and conclude this paper in Section 5. 

2. Chromatic Dispersion in Optical Fiber Communications 

2.1 Fundamental of chromatic dispersion 

The group velocity of the light depends on its wavelength when the light is propagating in 
mediums. This phenomenon is called CD or group velocity dispersion (GVD). In optical 
communications utilizing the optical fiber as a transmission medium, the optical pulse is 
affected by the CD as the propagation time depends on the constituent wavelength of the 
optical pulse as shown in Fig. 2. The CD has two contributions, material dispersion and 
waveguide dispersion in a single mode fiber (SMF). The material dispersion is attributed to 
the characteristics of silica that the refractive index changes with the optical wavelength. The 
waveguide dispersion is caused by the structure of optical fiber, the core radius and the 
index difference. 

Considering optical propagation in the fiber, the propagation constant ft is a function of the 
angular frequency co and expanded by Taylor expansion as follows. 

1 o 1 , 

ft(a)) = ft +ft l (a)-a) ) + -ft 2 (a)-a) ) + -ft 3 (a)-a) ) +••• (1) 

1 

Here, ox) is a center angular frequency, and fa, ft, pi, and ft are Taylor's coefficients. The 
time required for the propagation of unit length r is obtained by differentiating partially the 
propagation constant p as follows. 

1 - 

T(o)) = p l +fi 2 (co-co ) + -P 3 (cD-co ) +••• (2) 

It is confirmed from (2) that the required time is angular frequency dependent; the 
propagation time of optical pulse depends on the wavelength in optical communications. 
The coefficients ft and ft are first-order and second-order constants indicating the degree of 
the angular frequency dependence, respectively. Assuming that the second-order CD is 
negligible, the CD parameter is defined as 
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Fig. 2. Schematic diagram of chromatic dispersion. 
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(3) 



where c is the speed of light. The unit of the CD parameter is ps/nm/km. 

In SMF, the CD parameter is zero at around 1300 nm and about 20 ps/nm/km at the typical 

wavelength used for optical communications, around 1550 nm. We have many 

characteristics of optical fibers such as dispersion shifted fiber (DSF) whose CD parameter is 

zero at around 1550 nm for the reduction of CD effect in optical fiber communications, and 

dispersion compensating fibers (DCF) whose CD parameter is minus value for the purpose 

of static CD compensation. 

In optical fiber communications, the optical pulse is affected by the CD as it has relatively 

wide spectral range corresponding to the bit-rate. Assuming that the optical pulse is a 

Gaussian waveform for the simplicity, the waveform in time-domain is expressed as 



£/(0,r) = exp 



r T i a 

Zi o ; 



(4) 



where To is a full width at half maximum (FWHM) of the pulse. When the pulse is 
transmitted for arbitral distance z, the waveform is affected by the CD and distorted as 



U(z,T) = - 



(T 2 -jj3 2 zy 



-exp 



2(T 2 -jfi 2 z) 



(5) 
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Fig. 3. Optical pulses affected by chromatic dispersion. 
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where we neglect the second-order CD for simplicity as the first-order CD is dominant. 
Figure 3 shows the waveforms of optical pulse when we change the product of ft. and z 
under the condition that To=100 ps. The larger the product of pi and z is, the wider the 
FWHM of the transmitted waveform is; the effect of CD is larger in the case that the 
transmission distance is longer and the CD parameter is larger. If the FWHM of the optical 
pulse gets wider, the possibility of the inter symbol interference (ISI) is higher as shown in 
Fig. 4. As the ISI causes code error in optical communications, the transmission distance is 
limited by the CD. Also, the maximum transmission distance is reduced according to the bit 
rate of the transmission B because the FWHM of the optical pulse To is decreased when the 
bit rate increases. We can also understand it from the fact that the spectral width is wide in 
short optical pulse. The effect of CD on the bit rate B can be estimated and the CD tolerance 
Dt, the limitation of CD that the quality of the transmission is assured, is expressed as 
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\D T \<- 



BzAA 



(6) 



where AA is the range of wavelength in the optical pulse. The CD tolerance is inversely 
proportional to the bit rate and the transmission distance and the wavelength range of the 
input pulse. 

2.2 Adaptive chromatic dispersion compensation 

As mentioned in Section 1, the adaptive CD compensation is an essential technology for 
high-speed optical fiber communications as the CD tolerance is very small in the systems 
whose transmission speed is over 40 Gb/s. Many researches have been conducted for the 
adaptive CD compensation in optical communications. The principle of the CD 
compensation is very simple as shown in Fig. 5. We can achieve the CD compensation by 
placing a transmission medium which has the inverse CD value of the transmission fiber in 
the transmission line. The adaptive CD compensation is achieved by changing the 
compensating CD value adaptively according to the CD in the transmission fiber. The 
conventional setup of the adaptive CD compensation is shown in Fig. 1; the tunable CD 
compensator is feedback controlled with the real-time CD monitoring. In this section, 
tunable CD compensators and CD monitoring techniques are briefly introduced for the 
background information of the adaptive control algorithm to be proposed. 
We have many types of tunable CD compensators for the adaptive compensation. They are 
basically implemented by the dispersive medium with the function of tunability, for 
example, chirped fiber Bragg grating (CFBG) with heater elements (Matsumoto et al., 2001; 
Eggleton et al., 2000), micro-electro mechanical system (MEMS) (Sano et al., 2003), ring 
resonator (Takahashi et al., 2006), and so on. We adopt a virtually imaged phased array 
(VIPA) compensator in the following research. The VIPA compensator is a tunable CD 
compensator, which is consisted of the combination of a VIPA plate and a three dimensional 
adjustable surface mirror (Shirasaki, 1997; OOi et al., 2002). The VIPA plate operates as a 
grating, and the specific spectral components of light is reflected by the mirror to induce CD. 
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Fig. 5. Principle of chromatic dispersion compensation. 
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method 


Good 


Good 


Fair 


Good 


Not 
required 


Clock phase 
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Good 


Fair 


Good 


Poor 


Not 
required 


Eye-diagram 


Good 


Fair 


Excellent 


Good 


Not 
required 


BER 


Fair 


Poor 


Excellent 


Good 


Required 



Table 1. Performances of feedback signals in adaptive CD compensation 

We can generate arbitral CD value as the change of the geometry of the three dimensional 
mirror. In the VIPA compensator, wide compensation range, ±1800 ps/nm in 10 and 40 
Gb/s, is achieved by the appropriate design of the three dimensional mirror. 
Also, many kinds of CD monitoring methods are studied and demonstrated for the feedback 
control of the tunable CD compensators. The typical monitoring signals are bit error rate 
(BER), eye-diagram, clock power level (Sano et al., 2001), and phase difference of clock 
signals (Qian et al., 2002). We show the performance comparison of the feedback signals for 
adaptive control of the tunable CD compensator in Table 1. The requirement of pilot signal 
is the disadvantage for the BER as the monitoring signal. If we consider each characteristic 
of the feedback signal, the extracted-clock power level or the eye-diagram is better for the 
feedback signal in adaptive CD compensation. We adopt the eye-opening value obtained 
from the eye-diagram as the feedback signal in the adaptive control method to be proposed. 

3. High-Speed Adaptive Control Method Based on Steepest Descent Method 

In this section, we propose a method of high-speed adaptive control of tunable CD 
compensator for adaptive CD compensation. We apply the steepest descent method to the 
adaptive control algorithm in order to reduce the compensation time. The approximation of 
partial derivative for the steepest descent approach is proposed and applied to the control of 
the VIPA compensator. 



3.1 Steepest descent-based control algorithm for adaptive chromatic dispersion 
compensation 

The adaptive control system must be low cost, high-speed, and applicable to wide 
dispersion ranges for the adaptive CD compensation in optical communications. Most 
control systems require high-cost measuring instruments for the CD monitoring. We 
therefore propose the feedback control method that does not require high-cost CD 
monitoring. In our proposal, the feedback signal is a received waveform in the time domain. 
The tunable CD compensator is controlled repeatedly to reshape the waveform. The 
measurement of the waveform is relatively easy and uninfluential in the transmission 
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conditions such as pilot-signal requirements. Conventional feedback control is based on the 
hill-climbing method, which requires a lot of time for optimization. We have therefore 
applied the steepest descent method to the feedback control for high-speed compensation. 
Figure 6 shows an optical dynamic routing network with the adaptive CD compensation. 
Transmitted signals are passed through a route that is chosen arbitrarily among optical 
paths, being affected by the CD. In the receiver part, the degraded signals are fed into the 
tunable CD compensator and the dispersion is compensated. The adaptive dispersion 
compensation is achieved by the combination of a tunable CD compensator and a controller. 
The compensated signals are received by a photodiode and demodulated. 
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Fig. 6. Schematic diagram of all-optical dynamic routing network with the adaptive 
dispersion compensation technique. 
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The tunable compensator is controlled by our proposed adaptive control method based on 
the steepest descent method. The proposed procedure of the controller is shown in Fig. 7, 
where P ou t and P re f are the eye-opening values (normalized as P re f = 1) of the received and 
reference signals, f out and f re f respectively. In this method, we measure and register the 
reference signal, f re f which is a received signal unaffected by the CD. The reference signal is 
determined from the characteristics of the transmitter-receiver set. Therefore, we can copy 
the reference signal to other receivers after it has been measured once. 

The first step is a calculation of an error value: Er. The error value is defined as the 
difference between the eye-opening values, P ou t and P re f. 

Er = ^(P ref -P out ) 2 (7) 

The next step is a calculation of partial derivatives of Er in terms of the control parameters, 
Xi (z=l,2,. . ., n), for the update based on the steepest descent method. 

f^ = (^"^)f- (8) 

ox i ox t 

We need to measure small changes in P out when X{ changes slightly in order to get the 
accurate partial derivatives of P ou t with respect to X{. However, this is unrealistic as it takes a 
lot of time for the measurement and our goal is to achieve quick CD compensation. 
Therefore, we approximate the partial derivatives of P ou t with respect to x\. The 
approximation is to be mentioned at the next subsection. 
In the final step, the control parameters are update as 

*,-=>*,--*^— ( 9 ) 

ox i 

where s is an appropriate constant concerning the speed and accuracy of the convergence. 
We repeat this procedure until the transmission quality becomes optimal. The required 
number of update iterations is fewer than that of the normal feedback control based on the 
hill-climbing method due to the steepest descent approach. In practical all-optical dynamic 
routing networks, the procedure is repeated all the time as the transmission route changes at 
frequent intervals. 

3.2 Approximation of partial derivatives for steepest descent approach 

To approximate the partial derivatives of P ou t with respect to Xi, we need to know the change 
in one-bit waveforms of the received signal, w ou t(t), caused by the change in x\. When we 
assume that the waveform of the transmitted signal is a Gaussian-like pulse (the peak level 
is unity) just like in the approximation in return-to-zero transmissions and that the 
transmission is affected only by the CD, the waveform, w ou t(t) is calculated analytically in 
terms of the CD values of the transmission fiber, Dp er ps/nm and TDC, Dtdc ps/nm, as 
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w out(t) = v peak exp 



2.2 A 



peak 



(10) 
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where Tfwhm is the FWHM of the transmitted signal, X is the center wavelength , t is time, 
and c is the speed of light. The partial derivative of w out (t) with respect to %\ is calculated 
from (10) and (11). 
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(12) 



Equation (10) shows that the value v pea k is the peak level of w ou t(t). We can measure it in a 
practical system. Therefore, (12) shows that we can obtain the approximated partial 
derivative of w ou t(t) with respect to x\ because Tfwhm and X are known parameters. We obtain 
the partial derivative of the peak value in w out (t) by substituting for t 
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(13) 



The value of v pea k corresponds to the eye-opening value in nonreturn-to-zero (NRZ) 
transmission approximately. Therefore, the partial derivative of P out with respect to X{ is 
approximated as follows. 



dP 
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X 2 dD TD 

2tx dx, 



(14) 



3.3 Detailed control algorithm for VIPA compensator 

In the simulations described in the next section, we employ a VIPA compensator as the 
tunable CD compensator. The VIPA compensator has a single control parameter, i.e. CD S 
ps/nm. The detailed control procedure of the VIPA compensator is as follows. In general, 
we can apply the proposed method to any kind of tunable CD compensators. 

(i) Initialize the parameter of the VIPA compensator: S ps/nm 



S = ps/nm 



(15) 



(ii) Calculate the error value: Er 
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The error value, Er, is derived from (7). 

If Pout is zero, we go to (iii), otherwise to (iv). 

(iii) Update S by the hill-climbing method 

S^S-AS (16) 

where AS ps/nm is an appropriate small constant. We then go on to (v). 

(iv) Update S by the steepest descent method 

We calculate the partial derivative of Er from (8) and (14). 

The partial derivative of P ou t with respect to S is approximated as 



(17) 
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The parameter S is updated as 




8S 





(18) 



where ^is an appropriate constant. We then go to (v). 
(v) Judge the error value: Er 

We calculate Er again by using (7). If Er increases or becomes small enough, the 
procedure stops. Otherwise, we go back to (ii) and repeat the same process. 
However, in practical all-optical dynamic routing networks, the compensation 
process is repeated all the time as the dispersion value changes frequently. 

4. Transmission Simulations at 40 Gb/s 

4.1 Simulation results in NRZ-OOK transmission at 40 Gb/s 

Numerical transmission simulations using OptiSystem were conducted to verify the 
application of the proposed technique to 40 Gb/s optical fiber transmission system. In the 
proposed control method, we have to set the constants for search, s and AS, appropriately. 
They were adjusted for the 40 Gb/s transmission and set at 3><10 5 and 30, respectively. The 
output power of a distributed feedback laser diode (DFB-LD) at the transmitter was dBm. 
We supposed that the modulation format were NRZ-OOK. The central wavelength of the 
transmitted signal was 1550 nm. The transmission path was a non-zero dispersion shifted 
fiber (NZ-DSF). We assumed that CD, polarization mode dispersion (PMD), self -phase 
modulation (SPM), and other nonlinearity affect the transmitted signal. The power loss was 
amplified to dBm by an erbium-doped fiber amplifier (EDFA) after both of the fiber 
transmission and the dispersion compensation by the VIPA compensator. The EDFA, the 
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Fig. 8. BERs at every update of the compensator (a) 0->20km, (b) 20->25km. 
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Fig. 9. Eye-diagrams at every update of the compensator (a) 0->20km, (b) 20->25km. 



receiver, and other optical components were assumed to have moderate levels of noise. 
Transmission simulation results are shown in Fig. 8 and Fig. 9. In these simulations, the 
initial value of S was set at 0. Fig. 8 shows the BERs at every update of the VIPA 
compensator when the transmission distance changes (a) from to 20 km (dispersion: -> 
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100 ps/nm) and (b) from 20 to 25 km (dispersion: 100 -> 125 ps/nm). The compensation 
improved the BER (a) from 1.0x10° to l.OxlO- 15 and (b) from 7.41*10-3 to l.OxlO- 15 , 
respectively. As shown in Fig. 9, the eye-diagrams measured by a sampling oscilloscope 
were found reshaped. These results show that the CD compensation with the proposed 
control method improve the transmission quality. 

The update iteration number to achieve a sufficiently low BER (<10 9 ) were four and two, 
respectively in these two cases. A single calculation of the next dispersion value requires less 
than 10 ms. The response time of the VIPA compensator is 2 ms for every 1 ps/nm 
compensation. Therefore, the time required for ±400 ps/nm CD compensation is about 1 s, 
which is determined practically by the response time of the VIPA compensator. This 
technique achieves a high-speed adaptive control of a tunable CD compensator in 40 Gb/s 
transmission since the update iteration number is small and the calculation time with the 
proposed approximation is short enough. The proposed technique is more effective if the 
response time of the tunable CD compensator is faster as the required iteration number is 
decreased by our proposed adaptive control technique based on the steepest descent 
method. 



4.2 Compensation range and required iteration number 

Figure 10 shows the compensation range of the proposed method at 40 Gb/s. We measured 
BERs before and after compensation when the CD value changed from ps/nm to arbitrary 
value. The compensation range in which the BER after compensation is less than 10~ 9 is 
about from -450 to 450 ps/nm, corresponding to a NZ-DSF path-length change of about ±90 
km. This range is wide enough for compensating the change of CD caused by dynamic 
routing. In this wide compensation range, the iteration number required for error free 
transmission (BER<10 9 ) is less than 15. The fast adaptive CD compensation is also achieved 
by the proposed adaptive control technique as the required iteration number is small. 
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5. Conclusion 

In this chapter, we have proposed high-speed adaptive CD compensation with the adaptive 
control method based on the steepest descent method and reported the performances 
evaluated by numerical simulations. 

The simulation results show that the proposed control method based on the steepest descent 
method controls the tunable CD compensator quickly and effectively for a wide dispersion 
range in 40 Gb/s transmission. The range is up to ±450 ps/nm, and the required 
compensation time is about 1 s for the CD variation within ±400 ps/nm at 40 Gb/s. These 
achievements are valuable for the future optical networks employing dynamic routing 
technique. 
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1. Introduction 

Hysteresis phenomenon occurs in all smart material-based sensors and actuators, such as 
shape memory alloys, piezoceramics and magnetostrictive actuators (Bank & Smith, 2000; 
Tan & Baras, 2004). In order to study this phenomenon, different models were proposed 
(Brokate & Sprekels, 1996; Visintin, 1994). Normally, hysteresis models are classified into 
two categories, physics-based model such as Jiles-Atherton model (Jiles & Atherton, 1986) 
and phenomenology-based model such as Preisach operator (Brokate & Sprekels, 1996; 
Visintin, 1994) and Duhem model (Visintin, 1994). From control systems point of view, 
hysteresis is generally non-differentiable, nonlinear, and unknown. As a result, systems 
with hysteresis usually exhibit undesirable inaccuracies or oscillation and even instability. 
Mitigating the effect of hysteresis becomes necessary and important, thus it has received 
increasing attention in recent years (Tao & Kokotovic, 1995, Su, et al, 2000, Su, et al, 2005). 
Many of these studies are related to modeling of hysteresis and their control issues. 
With the development of artificial intelligent (Al), Al is being applied to dealing with 
nonlinearities in systems (Ge & Wang, 2002). Only a few studies have been carried out by 
using NN to tackle hysteresis modeling and compensation (Makaveev, et al, 2002; 
Kuczmann & Ivanyi, 2002; Beuschel et al, 1998; Zhao & Tan, 2006). In the paper (Makaveev, 
et al, 2002), a NN model is used to describe the hysteresis behavior in different frequencies 
with the knowledge of some properties of magnetic materials, such as loss separation 
property to allow the separate treatment of quasi-static and dynamic hysteretic effects. 
Beuschel et al used (Beuschel et al, 1998) a modified Luenberger observer and NN are used 
to identify a general model of hysteresis. These researches demonstrate that NN can work as 
an unknown function approximator to describe the characteristics of hysteresis. Recently, 
two papers (Zhao & Tan, 2006; Lin et al 2006) applied the approximation property of NN to 
coping with the identification of Preisach-type hysteresis in piezoelectric actuator, and the 
hysteresis estimation problem for piezo-positioning mechanism based on hysteresis friction 
force function, respectively. It should be noted that the aforementioned results share a 
common assumption that the output of hysteresis is measurable. 

In practical systems, smart actuators are integrated into the systems, which makes the 
measurement of output of hysteresis hard. Hence it is a challenge to design an observer for 
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the unavailable output of hysteresis. Due to the unavailability of the output of hysteresis, 
the major obstacle of pre-inversion compensator for hysteresis is the lack of effective 
observer design methods for piezoelectric actuators. Especially, the traditional "Luenberger- 
type" nonlinear observer design (Krener & Isidori, 1983) or the "high-gain" observer 
(Krener & Kang, 2003) cannot be applied directly, since the hysteresis is highly nonlinear. 
The sliding-mode observer was developed to estimate the internal friction states of LuGre 
model for the servo actuators with friction (Xie, 2007). This observer needs a low-pass filter 
to remove the high-frequency components in the estimated state variable, which is not 
applicable in this paper. Yang and Lin (Yang & Lin, 2004) proposed homogeneous 
observers design for a class of n-dimensional inherently nonlinear systems whose Jacobian 
linearization is neither controllable nor observable. 

Inspired by NN's universal approximation property, and the aforementioned facts in 
observer design, we propose an observer-based adaptive control of piezoelectric actuators 
with unknown hysteresis in this paper. The main contribution of this paper is the following: 
First, it applies the NN to on-line approximate complicated piecewise continuous unknown 
nonlinear functions in the explicit solution to Duhem model. Second, an observer is 
designed to estimate the output of hysteresis of piezoelectric actuator based on the system 
input and output. Third, the stability of the controlled piezoelectric actuator with the 
observer is guaranteed by using Lyapunov extension (Kuczmann & Ivanyi, 2002). 
The organization of the paper is as follows. In Section II, a Duhem model of hysteresis and 
the problem statement are given. The main results on NN-based compensator for hysteresis 
are presented in Section III. Section IV provides an example to show the feasibility of the 
proposed method. Conclusions are given in Section V. 

2. Preliminaries 

2.1 Duhem model of hysteresis 

Many different mathematic models are built to describe the hysteresis behavior, such as 
Preisach model, Prandtl-Ishlinkii model and Duhem model (Coleman & Hodgdon, 1987; 
Macki et al, 1993). Considering its capability of providing a finite-dimensional differential 
model of hysteresis, we adopt classical Duhem model to develop the adaptive controller for 
the piezoelectric actuator. 

The Duhem model is a rate independent operator, with input signal v , v and output 
signal t . The Duhem model describes hysteresis H(t) by the following mathematical 
model (Coleman & Hodgdon, 1987; Macki et al, 1993). 



d t 

~dT 



dv 



dt 



[fM-T]+~g(v) W 

dt 



where a is a positive number, f(y) and g(v) are prescribed real- valued functions on 
(-oo,+oo) . 
It can also be represented as (Coleman & Hodgdon, 1987; Macki et al, 1993): 

dr \ a -[/(v)-r] + g(v), v>0 ^) 



dv [-a -[f(v)-r]+ g(v), v < 
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where a is the same positive number in (1), g(v) is the slope of the model, and f(v) is the 
average value of the difference between upward side and downward side. 
Property 1 (Coleman & Hodgdon, 1987; Macki et al, 1993): f(y) is a piecewise smooth, 

monotone increasing, odd function with a derivative f'(v) , which is not identically zero. For 
large value of input v(t) , there exists a finite limit f'(po) ; 



/(v) = -/(-v), lim/'(v)« 



(3) 



Property 2 (Coleman & Hodgdon, 1987; Macki et al, 1993): g(v) is a piecewise continuous, 
even function with 



g(v) = g(-v), limg'(v)<°° 



(4) 



It has been shown that Duhem model can describe a large class of hysteresis in various 
smart materials, such as ferromagnetically soft material, or piezoelectric actuator by 
appropriately choosing /(v)and g(v) (Coleman & Hodgdon, 1987; Macki et al, 1993). One 

widely used pair of functions of f(y) and g(v) are 



a r v s +a 2 (v-v s ) f 0} 



v >v c 



f(y) = \ a v v far |v|<v, 

\-a l -v s + a 2 (v + v s ) for v < —v s 



(5), 



g(v) = a 3 

where v^>0, a x > , a 2 > , \>a 3 >0, a x and a 2 satisfy a x ,a 2 e [a min a 
a mSLX are known constants. Substituting the f(y) and g(v) into (2), we have 



c]/ ^n 



(6) 
and 



«• vfq • v^ +ci2(y-v s ) -r] +<2 3 • v 

a-^r-a^+a^-v 
a- i[q • v 5 -^(v+v 5 ) +r] +<2 3 • v 



v>Vy,v>0 

0<v<Vy,v>0 

-v s <v<0,v<0 

v<-v„,v<0 



(7) 



The above equation can be solved for 



a 2 -v-f 2l 
a v v- f 22 
a r v-f 23 
a 2 -v- f 24 



v > v s , v > 

0<v<v 5 ,v>0 

-v s < v < 0,v < 

v < -v c ,v < 



(8) 
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with 



fa = (<h ' v o " T o) ■ e ~ a(V ~ V ° ) " e ~ aV [ o (<h ~ a i) ■ e " <d ^ ~ (°i ~ a i) ■ v s v>V V>0 
fa = («i • v o " h) ■ e~ a(v ~ Vo) ~ e~ av [ o i<h - «l) ■ e " <d C 0<V<^,V>0 

f 23 =(o, • v -r )-e a(v - v » ) -e" v £(a, -o,)-^^ _ v ^ < V <Qv<0 

/ 24 =ifh • v -r )-e a(v " Vo) -e av £(«3 -a 2 )-e" a< ^+(^ -a 2 )-v s V<-^,V<0 



(9) 



In order to describe the piezoelectric actuator, we choose the same functions f(y) and g(v) 
as those in (Banning, et al, 2001), which is a special case of the foregoing choice of f(y) 
and g{y) , i.e. a x - a , a 2 = and a 3 =b when v < v s . 



/(v) = 



a-v s for v>v s 

a ■ v for |v| <v s 

-a-v s for v<-v s 



(10) 



g(v) = 



for v >v s 
b for Ivl <v 5 
ybr v < -v 5 



(ii) 



where v^> , a>0 , Z? > and a>b>a/2 . Suppose the parameter a 
satisfies a e [a min # max J, a min anc * tf max are known constants. 



Substituting (5) and (6) into (1), we have 

a-i[a-v s -r] 
a-\[a-v-r]+b-v 
a-i[r-a-v\+b-v 

a-\[a-v s +r] 



v>v s ,v>0 

0<v<v s ,v>0 

-v s < v<0,v<0 

v<-v^,v<0 



(12) 



Equation (7) can be solved for r 



-/21 


v>v s ,v>0 


a-v-f 22 


0<v<v s ,v>0 


a-v-f 23 


-v s < v<0,v<0 


~f24 


v<-v^,v<0 



(13) 



Adaptive Control of Piezoelectric Actuators with Unknown Hysteresis 



263 



where 



/21=-^^ V0)l ~ aV s V>V s ,V>0 

/ 22 K^v -r )-^^ ) -^ v -[(^-a).^^0<v<v,,v>0 



dy-v Q ) -av 



/23=(«-v -r )- e ^' -e^ ■ [$-a)-e*dS Vj " V< .' 

•% v<-v s ,v 

f 24 =-T e^Kav s 



c ^-v s <v<0,v<0 
<0 



Equation (8) can be also expressed as: 

T=a'Z\ 'V-tf 2 \'%l Z3 +fll'& 'Z3 +fl3'Zl 'fo+fl4'Z2 ' Zd 

where % t (i = 1,2 • • • 4) are indicator functions defined as: 



W>v/ 



|t|<v, 

1 H>v/ 



1 v>0 
v<0' 



v>0 

1 v<0' 



Following the definition of the indicator functions, we get 

Z\'Zi=^ Z1+Z2 =1, &'Z\=*\ Z\+Z2= l > &=%vk=\23,4 
By defining % x - % 2 — / we have 

T=a'Xvv^ 2V Z2'Z3+k2'ZvZ3 + k3'ZvZA^f2A'ZrZd 
Let 

F2=k\'Z2'Z3+k2'Z\'Z3+k3'Z\'^f2A'Z2'ZA 

and K a = a%\ • We can also write the derivative of r as 

i = K„v - F 



(14) 



(15) 



(16) 



2.2 Augmented Multilayer Perceptron (MLP) Neural Network 

The MLP NN has been explored to approximate any function with arbitrary degree of 
accuracy (Hornik et al, 1989). However, it needs a large number of NN nodes and training 
iterations to approximate non-smooth functions (i.e. piecewise continuous), such as friction, 
hysteresis, backlash and other hard nonlinear ities. For these piecewise continuous functions, 
the MLP needs to be augmented to work as a function approximator. Results for 
approximation of piecewise continuous functions or functions with jumps are given in the 
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paper (Selmic & Lewis, 2000). We use the augmented NN to approximate the piecewise 
continuous function in hysteresis model. 

Let ^ be a compact set of R n and define C n (S) be the space such that the 

map f(x) : S — » R n is piecewise continuous. The NN can approximate a 

function /(x) e C n (S) , x e R n , which has a jump at x = c and is continuous from the right 
as 

f( x ) = W T • cr(V T • x) + W f T -(p[V f T • - c)] + s{x) (17) 

where £\X) is a functional restructure error vector, W , W r and V , V r are 

nominal constant weight matrices, cr(-) and ^(-) are activation functions for hidden 

neurons. 

For the hysteresis model (16), the piecewise continuous function F2 will be approximated by 

the augmented NN. In this paper, it is assumed that there exists weight matrix W such that 

|^(x)| < s N with constant s N > , f or allx e R n , and the Frobenius norm of each matrix is 
bounded by a known constant \\W\\ < W N with W N > . 

3. NN-based compensator and controller design 

Given the augmented MLP NN and hysteresis model, a NN-based pre-inversion 
compensator for the hysteresis is designed to cancel out the effect of hysteresis. In this 
section, a novel approach is developed to compensate the hysteretic nonlinearity and to 
guarantee the stability of integrated piezoelectric actuator control system. 

3.1 Problem statement 

Consider a piezoelectric actuator subject to a hysteresis nonlinearities described by Duhem 
model. It can be identified as a second-order linear model preceded by hysteretic 
nonlinearity as follows: 

m.y(t)+b.y(t) + k.y(t) = k-c.T (t) 

(18) 
T pr (t) = H[v(t)] 

where v(t) is the input to piezoelectric actuator, y(t) denotes the position of piezoelectric 
actuator, m , b , k denote the mass, damping and stiffness coefficients, respectively, 
//(•) represents the Duhem model (1). 

In order to eliminate the effect of hysteresis on the piezoelectric actuator system, a NN- 
based hysteresis compensator is designed to make the output from hysteresis model r 

approach the designed control signal r d . After the hysteresis is compensated by the NN, an 
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adaptive control for piezoelectric actuator is to be designed to ensure the stability of the 
overall system and the boundedness of output tracking error of the piezoelectric actuator 
with unknown hysteresis. 
We consider the tracking problem, in which y(t) is to asymptotically track a reference 

signal y d (t) having the properties that y d (t) and its derivatives up to second derivative are 

bounded, and y d (t) is piecewise continuous, for all t > . The tracking error of the 
piezoelectric actuator is defined as 

e p (t) = y d (t)-y(t)- (19) 

A filtered error is defined as 

r p (t) = e p (t) + A p -e p (t) (20) 

where A p > is a designed parameter. 

Differentiating r At) and combining it with the system dynamics Eq. (18), one may obtain: 

m . b m ..... . 

I^' rp= ~irc' rp ~ Tpr *ir c ' (yd+ p ' 6p) 

a t rv C a C /01 \ 

b ,. ,. k. .1 

+7 — (ydHAp--)-e P )+—yd' 

K ' C u C 

The tracking error dynamics can be written as 



m . b T 

r r„ - r „„ + Y, 

k-c k-c 



r p =—j r p~ T pr+ Y d ' e P (22) 



where y 

1 d 



and 



k. 



yd +( a p --r)-e P yd +A p- e P yd 



6 



is a regression vector 



m b \ 

k-c k-c c 



G R 3 is a unknown parameter vector with 



pmin < 6 pi < #p max i = 1, 2, 3 where ' in and pmax are some known real numbers. 



3.2 NN-based Compensator for Hysteresis 

In presence of the unknown hysteresis nonlinearity, the desired control signal T d for the 

piezoelectric actuator is different from the real control signal r . Define the error as 



1 p l pd L pr 



(23) 
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Differentiating (23), yields 



thus, we have 



*p=ipd-ipr ( 24 ) 



* P =ipd-ZaV + Fi ( 25 ) 

Here we utilize a second first-layer-fixed MLP to approximate the nonlinear function F 2 . 

F 2 = W 2 T -cj{V 2 t -h) + W f2l T -cp 2l \V f2X T -(h-c 2l )] 

+ W f22 T -<p 22 [V f22 T -(h-c 22 )] ( 26 ) 

+ W f23 T -<p 23 [V f23 T -(h-c 23 )] + e 2 (h) 



where h = \u d t v v 1 J , r is the initial value of the control signal, V 2 , V 



T 
fl\ > 

V y 2 2 > an d Vfii are input-layer weight matrices, W 2 , ^21 / ^7 22 / anc ^ ^7 23 are 

output-layer weight matrices, 0, v^ , and -v s are jump points on the output layer, and 

°"(')/ ^21 0) / ^2i(0 ' anc * fe(') are tne a ctivation functions, and ^(/O is the functional 
restructure error in which inversion error is included. Output-layer weight 

T T T T 

matrices W 2 , Wr 2l , Wr 22 and Wr 2 ^ are trained so that the output of NN approximates 

to the nonlinear function F 2 . 
Let 

^hv s )=[o(y 2 T -h) (frWfil'K) (frWfil -h-v s ) <e> 3 (v f23 'h+v s )f 

and w { T =[W 2 T W f2l T W ' f22 W f23 T ]- The nonlinear function F 2 is expressed as: 

F 2 =W 1 T ®(h,v s ) + s l (h) (27) 

It is assumed that the Frobenius norm of weight matrix Wl is bounded by a known constant 
l^ll < W lN with W lN > and |^i(^)| ^ s lN with constant s lN > , for all x e R n . 

The estimated nonlinear function F 2 is constructed by using the neural network with the 
weight matrix W x : 

F 2 = W x T ®(h,v s ). 

Hence the restructure error between the nonlinear functions F 2 and F 2 is derived as: 
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F 2 =F 2 -F 2 =W x T ®(h,v s ) + £ x (h) 

"where W X T =W X T -W X T ." 

Remark 1 When the input changes its sign derivative (Beuschel et al, 1998), the augmented 

MLP can approximate the piecewise continuous functions. In the process, the "jump 

functions" leads to vertical segments in the feed-forward pre-inversion compensation, 

where the "functional restructure error" can be confronted by the adaptive controller in 

Section III.C (Selmic & Lewis, 2000). 

A hysteresis pre-inversion compensator is designed: 

v = ^{k b -t p +i pd + W x T -@{h,v s )+r p } (28) 

where fi = ™ in is an estimated constant, which satisfies < ju < 1 with the known 

a 

boundary of a e [a min a max J , k b is a positive constant, a is the estimated values of a , 
and W x =[W 2 T W f2l T W f22 W f23 T ] is the estimated output-layer weight matrix 

w*. 



Define error matrix as: 

Inserting (26), (28) into (25), we obtain 



w x = w x - w x 



Tp=-k b 'M'Ka'Tp+( l -M'Ka)'tpdHl-M'Ka) 

•W X T ^hv s )-ju-K a -r p+ W x T .®Qi,v s )+s x (h) 



(29) 



We choose weight matrix update rule as 

w^reih^-^+k^-Wt (30) 

where T is a positive adaptation gain diagonal matrix, and k x is a positive constant. 
Design the update rule of parameter ju in pre-inversion compensator v as 

k = Proj(p, tf-vtv+tffefovj + r,] (31) 

where rj is positive constant, Proj(.) is a projection operator, which is defined as follows: 
fi = ¥roj(fi, -rj -r pr • fi-[f pd +®(h,v s )] = 
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if ju = \ and T]-t pr -[i pd +©(h,v s )]<0 





if I-^M </i<l 
-T]-T pr -jU-[t pd +Q(h,v s )] or fi = l and tj ■ t pr ■ [t pd + @(h, v s )] > 

and Tj-T pr -[t pd +@(h,v s )]<0 







if ju-- 



and T]-r pr -[r pd +&(h,v s )]>0 



(32) 



The adaptive NN-based pre-inversion compensator v is developed to drive the adaptive 
control signal r d to approach the output of hysteresis model r so that the hysteretic 

effect is counteracted. 



3.3 Controller Design Using Estimated Hysteresis Output 

It is noticed that the output of hysteresis is not normally measurable for the plant subject to 
unknown hysteresis. However, considering the whole system as a dynamic model preceded 
by Duhem model, we could design an observer to estimate the output of hysteresis based on 
the input and output of the plant. 

The velocity of the actuator y(t) is assumed measurable. Define the error between the 
outputs of actuator and observer as 



e x =y-y 



(33) 



The observed output of hysteresis is denoted as f and the error between the output of 
hysteresis z and the observed i is defined as e 2 — r — i . Then the observer is 
designed as: 



y=y+L& 



I*! 



(34) 



? P r= K aV-F 2 +L 2 e x -K f 



(35) 



The error dynamics of the observer is obtained based on the actuator model and hysteresis 
model. 



e x — —L x e x — —L x e x 
e2=K a v-F 2 -L 2 e x +K pr i pr 



(36) 



where the parameter error is defined as K a = K a — K a 
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By using the observed hysteresis output f , we may define the signal error between the 
adaptive control signal r d and the estimated hysteresis output as: 

T pe = T pd ~ T pr ( 37 ) 

The derivative of the signal error is: 

*pe = Tpd ~ K a V + ?2 ~ L 2 e \ + K pr T pr • ( 38 ) 

A hysteresis pre-inversion compensator is designed: 

v = fi ' {k b • t^ + ipd +F 2 +r p ) . (39) 

By substituting the neural network output F 2 = W x ®(h, v s ) and pre-inversion compensator 
output into the derivative of the signal error, one obtains: 

^ pe ={\-k a ju)r pd -k a ju-k^^ -la +K pr r pr (40) 

The weight matrix update rule is chosen as: 

^=r0(A,v,) •?,, + *,, I?^-^ (41) 

And the update rule of parameter ju in pre-inversion compensator v is designed with the 
same projection operator as (32): 

jii=ProJjLh V'T pe \T pd +W^kv s )+r p -\). (42) 

The update rule of parameter K a in the observer (35) is designed with the same projection 
operator as (32): 

K a =ProXK a , r'M'T pe if pd +W^Kv s )+r p ]+v-? pe ). (43) 

Hence we design the adaptive controller and update rule of control parameter as: 

T pd =k pd -r p +Y d T '.0 p (44) 

O p =ProJ 6 p9 P-Y d .r p ) (45) 



270 Adaptive Control 

where the projection operator is 

{Proj § p , fi-Y d -r p )} t = 

if e pi =6 pmxL and p-{Y d -r p ),<o 

U t/pmin ^ "pi ^ "/?max 

P\Y d .r p \or 3 pi =0 pmax and fi-(Y d .r p ) t > 

or pi ^6 pmm and P\Y d -r p \<^ 

V pi =6 pmm and p.iYj.r^X) 

With the adaptive robust controller, pre-inversion hysteresis compensator and hysteresis 
observer, the overall control system of integrated piezoelectric actuator is shown in Fig. 3. 
The stability and convergence of the above integrated control system are summarized in 
Theorem 1. 

Theorem 1 For a piezoelectric actuator system (18) with unknown hysteresis (1) and a 
desired trajectory y d (t) , the adaptive robust controller (44), NN based compensator (39) and 
hysteresis observer (34) and (35) are designed to make the output of actuator to track the 
desired trajectory y d (t) . The parameters of the adaptive robust controller and the NN based 
compensator are tuned by the updating rules (41)-(43) and (45). Then, the tracking error 
e {f) between the output of actuator and the desired trajectory y^{t) converge to a small 

neighborhood around zero by appropriately choosing suitable control gainst d , k b and 
observer gains L x , L 2 and K . 
Proof: Define a Lyapunov function 

1 7W 2 1 ■ — 2 1 stT'tTt^ -It^n 1 ,+ ~ T ^ n2 

V 2=T-. r p 2 +-T pe 2 +--tr(W x T T 'w l ) + -—-(l-pK a ) 1 

2 k-c 2 2 2't}K a 



+ ^(K a -K a ) 2 + ^-(0 p -§ p f-(0 p -§ p ) + ^e 2 + ^e 2 
The derivative of Lyapunov function is obtained: 

k-c y y y y rj y 

Introducing control strategies (39), (44) and the update rules (41)-(43), (45) into above 
equation, one obtains 

V 2 =^+k pd )-r p 2 -k y ju-K a -T pe 2 +£l ^^^ 

K'C 

-e 2 r p -I^e^+KprTprTpe-I^e 2 -(L& +F 2 )e 2 +K pr r pr e 2 

l~l 1 2 1 2 

By using r = r — e 2 , \F\ < 8 lN and inequality: ±ab <—a H — b , one has: 
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V 2 =-(— + k pd )-r 2 p -k b -M-K a -T p ^s x {K)T pe - k pX \r pe \tr(W x T W x ) 

1 2 1 2 1^22 1~2 1~2 l r ^2 2 1 r ^2 ~2 1 2 

+ 2 2 + 2 P + 2 2i + 2 pe +1 2 pe + 2 pr pr + 2 pr pe + 2 2 
-L x e\ +-(L 2 e x +s lN ) 2 +-e 2 2 +K pr z pr e 2 -K pr e\ 

By using the inequality — ( a + b) 2 <a 2 +b 2 , we can derive the following inequality: 



V 2 < -(— + k pd - -) • r 2 -{k b -ji-k a - -K 2 pr - 1) • r p 2 + \t ps \- s lN 

~ k pl ■ \ 7 pe | • 1^1 \WlN " 1^1 1) " (A " \ Ll Yl ~ (Kpr ~ M + £w + K 2 pr T 2 pi 



pr 



From the Property 1 of Chapter 2 in the recent book (Ikhouane & Rodellar, 2007), we know 

2 2 2 

r pr is bounded (say, r pr < M where M is a constant), and then define a constant 

8 = s 2 N + K 2 pr M 2 > s 2 N + K pr T pr such that 

V2^-(^ + k pd -h-r p 2 -(k b -M-K-^K-- i y?pe 2+ \? P e[tlN (47) 



- k pi -Kim^N -pi-cA --a 2 m 2 -c^ -2> 2 2 +^ 



2 
2 



We select the control parameters k pd , k b and observer parameters ^ , L 2 and A' 
satisfying the following inequalities: 

b i l n 

k-c 2 

K pr >2 

1 7 
^•«max'^a--^r-l>0 

Z, >-L ? 2 
1 2 

1 2 

Let £ m = ^ • a max • ^ - -^ - 1 . If we have 

i~ I ~k p \-W lN /4 + s lN 
\T n A> — ' 



' pe 



\w\>W w l2+jw? N l4-e w lk pX (48) 

we can easily conclude that the closed-loop system is semi-globally bounded (Su & 
Stepanenko, 1998). 
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Hence, the following inequality holds 



" k pl -W lN 2 /A + 8 lN 

< b , 



where b r > represents the radius of a ball inside the compact set C r of the tracking error 

Thus, any trajectory t (t) starting in compact set C r = \r \\\r \\< b r } converges within C r 

and is bounded. Then the filtered error of system r (t) and the tracking error of the 

hysteresis T (t) converge to a small neighborhood around zero. According to the standard 

Lyapunov theorem extension (Kuczmann & Ivanyi, 2002), this demonstrates the UUB 

(uniformly ultimately bounded) of r p (t) , T pe (t) , W x , e x and e 2 . 

Remark 2 It is worth noting that our method is different from (Zhao & Tan, 2006; Lin et al 
2006) in terms of applying neural network to approximate hysteresis. The paper (Zhao & 
Tan, 2006) transformed multi-valued mapping of hysteresis into one-to-one mapping, 
whereas we sought the explicit solution to the Duhem model so that augmented MLP neural 
networks can be used to approximate the complicated piecewise continuous unknown 
nonlinear functions. Viewed from a wavelet radial structure perspective, the WNN in the 
paper (Lin et al 2006) can be considered as radial basis function network. In our scheme, the 
unknown part of the solution was approximated by an augmented MLP neural network. 

4. Simulation studies 

In this section, the effectiveness of the NN-based adaptive controller is demonstrated on a 
piezoelectric actuator described by (18) with unknown hysteresis. The coefficients of the 
dynamic system and hysteresis model are m =0.016kg, b =1.2Ns/um, k =4500N/ um, c =0.9 

urn /V, a =6, b =0.5, v, =6 um /s, p = 0.1, k pd = 50 . 

The input reference signal is chosen as the desired trajectory: y d — 3-sin(0.2;r£) . The 

control objective is to make the output signal y follow the given desired trajectory y d . From 

Fig. 1, one may notice that relatively large tracking error is observed in the output response 
due to the uncompensated hysteresis. 

The Neural Network has 10 hidden neurons for the first part of neural network and 5 
hidden neurons for the rest parts of neural network with three jumping points (0, v s ,—v s ). 
The gains for updating output weight matrix are all set as T = diag{lO} 25X25 . The activation 
function cr(-) is a sigmoid basis function and activation function (p(-) has the 



definition <p(-) ■ 






l + e*" j 



x > , otherwise zero. The parameters for the observer are 



set as: K a = 20 , k b = 100 , rj = 0.1, y = 0.1 , K = 10 , L x = 100 , L 2 = 1 and initial 
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conditions are j)(0) = , f(0) = . The system responses are shown in Fig.2, from which it 

is observed that the tracking performance is much better than that of adaptive controlled 

piezoelectric actuator without hysteretic compensator. 

The input and output maps of NN-based pre-inversion hysteresis compensator and 

hysteresis are given in Fig. 3, respectively. The desired control signal and real control signal 

map (Fig. 3c) shows that the curve is approximate to a line which means the relationship 

between two signals is approximately linear with some deviations. 

In order to show the effectiveness of the designed observer, we compare the observed 

hysteresis output f and the real hysteresis output T r in Fig. 4. The simulation results 

show that the observed hysteresis output signal can track the real hysteresis output. 
Furthermore, the output of adaptive hysteresis pre-inversion compensator v(t) is shown in 
Fig.5. The signal is shown relatively small and bounded. 




Fig. 1 Performance of NN controller without hysteretic compensator (a) The actual control 
signal (dashed line) with reference (solid) signal; (b) Error y x - y d 




(a) 



?ltaA*/%/^NMv/< 



(b) 

Fig. 2. Performance of NN controller with hysteresis, its compensator and observer (a) The 

actual control signal (dashed line) with reference (solid) signal; (b) Error y x - y d 
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(a) 



Com pen suitor 




(b) 



ssirecl and Estimated CZ 




(c) 

Fig. 3. (a) Hysteresis's input and output map r vs. v ; (b) Pre-inversion compensator's 

input and output map v vs. r d ; (c) Desired control signal and Observed control signal 
curve r pr vs. r pd . 




Fig. 4. Observed Hysteresis Ouput £ and Real Hysteresis Output z 



pr 



:y>\j^\/X/\j / ~ x . 



Fig. 5. Adaptive Hysteresis Pre-inversion Compensator v(t) 
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5. Conclusion 

In this paper, an observer-based controller for piezoelectric actuator with unknown 
hysteresis is proposed. An augmented feed-forward MLP is used to approximate a 
complicated piecewise continuous unknown nonlinear function in the explicit solution to 
the differential equation of Duhem model. The adaptive compensation algorithm and the 
weight matrix update rules for NN are derived to cancel out the effect of hysteresis. An 
observer is designed to estimate the value of hysteresis output based on the input and 
output of the plant. With the designed pre-inversion compensator and observer, the stability 
of the integrated adaptive system and the boundedness of tracking error are proved. Future 
work includes the compensator design for the rate-dependent hysteresis. 
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1. Introduction 

For low cost, easy assembly and less maintenance, overhead crane systems have been 
widely used for material transportation in many industrial applications. Due to the 
requirements of high positioning accuracy, small swing angle, short transportation time, 
and high safety, both motion and stabilization control for an overhead crane system 
becomes an interesting issue in the field of control technology development. Since the 
overhead crane system is underactuated with respect to the sway motion, it is very difficult 
to operate an overhead traveling crane automatically in a desired manner. In general, 
human drivers, often assisted by automatic anti-sway system, are always involved in the 
operation of overhead crane systems, and the resulting performance, in terms of swiftness 
and safety, heavily depends on their experience and capability. For this reason, a growing 
interest is arising about the design of automatic control systems for overhead cranes. 
However, severely nonlinear dynamic properties as well as lack of actual control input for 
the sway motion might bring about undesired significant sway oscillations, especially at 
take-off and arrival phases. In addition, these undesirable phenomena would also make the 
conventional control strategies fail to achieve the goal. Hence, the overhead crane systems 
belong to the category of incomplete control system, which only allow a limited number of 
inputs to control more outputs. In such a case, the uncontrollable oscillations might cause 
severe stability and safety problems, and would strongly constrain the operation efficiency 
as well as the application domain. Furthermore, an overhead crane system may experience 
a range of parameter variations under different loading condition. Therefore, a robust and 
delicate controller, which is able to diminish these unfavorable sway and uncertainties, 
needs to be developed not only to enhance both efficiency and safety, but to make the 
system more applicable to other engineering scopes. 

The overhead crane system is non-minimum phase (or has unstable zeros in linear case) if a 
nonlinear state feedback can hold the system output identically zero while the internal 
dynamics become unstable. Output tracking control of non-minimum phase systems is a 
highly challenging problem encountered in many practical engineering applications such as 
aircraft control [1], marine vehicle control [2], flexible link manipulator control [3], inverted 
pendulum system control [4]. The non-minimum phase property has long been recognized 
to be a major obstacle in many control problems. It is well known that unstable zeros cannot 
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be moved with state feedback while the poles can be arbitrarily placed (if completely 
controllable). In most standard adaptive control as well as in nonlinear adaptive control, all 
algorithms require that the plant to be minimum phase. This chapter presents a new 
procedure for designing output tracking controller for non-minimum phase systems (The 
overhead crane systems). 

Several researchers have dealt with the modeling and control problems of overhead crane 
system. In [5], a simple proportional derivative (PD) controller is designed to asymptotically 
regulate the overhead crane system to the desired position with natural damping of sway 
oscillation. In [6], the authors propose an output feedback proportional derivative controller 
that stabilizes a nonlinear crane system. In [7], the authors proposed an indirect adaptive 
scheme, based on dynamic feedback linearization techniques, which was applied to 
overhead crane systems with two control input. In [8], Li et al attacked the under-actuated 
problem by blending four local controllers into one overall control strategy; moreover, 
experimental results delineating the performance of the controller were also provided. In [9], 
a nonlinear controller is proposed for the trolley crane systems using Lyapunov functions 
and a modified version of sliding-surface control is then utilized to achieve the objective of 
cart position control. However, the sway angle dynamics has not been considered for 
stability analysis. In [10], the authors proposed a saturation control law based on a 
guaranteed cost control method for a linearized version of 2-DOF crane system dynamics. 
In [11], the authors designed a nonlinear controller for regulating the swinging energy of 
the payload. In [12], a fuzzy logic control system with sliding mode Control concept is 
developed for an overhead crane system. Y. Fang et al. [13] develop a nonlinear coupling 
control law to stabilize a 3-DOF overhead crane system by using LaSalle invariance theorem. 
However, the system parameters must be known in advance. Ishide et al. [14] train a fuzzy 
neural network control architecture for an overhead traveling crane by using 
back-propagation method. However, the trolley speed is still large even when the 
destination is arrived, which would result in significant residual sway motion, low safety, 
and poor positioning accuracy. In the paper [15], a nonlinear tracking controller for the load 
position and velocity is designed with two loops: an outer loop for position tracking, and an 
inner loop for stabilizing the internally oscillatory dynamics using a singular perturbation 
design. But the result is available only when the sway angle dynamics is much faster than 
the cart motion dynamics. In the paper [16], a simple control scheme, based on second-order 
sliding modes, guarantees a fast precise load transfer and swing suppression during the 
load movement, despite of model uncertainties. In the paper [17], it proposes a stabilizing 
nonlinear control law for a crane system having constrained trolley stroke and pendulum 
length using the Lyapunov 7 s second method and performs some numerical experiments to 
examine the validity of the control law. In the paper [18], the variable structure control 
scheme is used to regulate the trolley position and the hoisting motion towards their 
desired values. However the input torques exhibit a lot of chattering. This chattering is not 
desirable as it might shorten the lifetime of the motors used to drive the crane. In the paper 
[19], a new fuzzy controller for anti-swing and position control of an overhead traveling 
crane is proposed based on the Single Input Rule Modules (SIRMs). Computer simulation 
results show that, by using the fuzzy controller, the crane can be smoothly driven to the 
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destination in a short time with low swing angle and almost no overshoot. D. Liu et al. [20] 
present a practical solution to analyze and control the overhead crane. A sliding mode fuzzy 
control algorithm is designed for both X-direction and Y-direction transports of the 
overhead crane. Incorporating the robustness characteristics of SMC and FLC, the proposed 
control law can guarantee a swing-free transportation. J. A. Mendez et al. [21] deal with the 
design and implementation of a self-tuning controller for an overhead crane. The proposed 
neurocontroller is a self-tuning system consisting of a conventional controller combined 
with a NN to calculate the coefficients of the controller on-line. The aim of the proposed 
scheme is to reduce the training-time of the controller in order to make the real-time 
application of this algorithm possible. Ho-Hoon Lee et al. [22] proposes a new approach for 
the anti-swing control of overhead cranes, where a model-based control scheme is designed 
based on a V-shaped Lyapunov function. The proposed control is free from the 
conventional constraints of small load mass, small load swing, slow hoisting speed, and 
small hoisting distance, but only guarantees asymptotic stability with all internal signals 
bounded. This paper also proposes a practical trajectory generation method for a near 
minimum-time control, which is independent of hoisting speed and distance. In this paper 
[23], robustness of the proposed intelligent gantry crane system is evaluated. The evaluation 
result showed that the intelligent gantry crane system not only has produced good 
performances compared with the automatic crane system controlled by classical PID 
controllers but also is more robust to parameter variation than the automatic crane system 
controlled by classical PID controllers. In this paper [24], the I-PD and PD controllers 
designed by using the CRA method for the trolley position and load swing angle of 
overhead crane system have been proposed. The advantage of CRA method for designing 
the control system so that the system performances are satisfied not only in the transient 
responses but also in the steady-state responses, have also been confirmed by the simulation 
results. 

Although most of the control schemes mentioned above have claimed an adaptive 
stabilizing tracking/ regulation for the crane motion, the stability of the sway angle 
dynamics is hardly taken into account. Hence, in this chapter, a nonlinear control scheme 
which incorporates both the cart motion dynamics and sway angle dynamics is devised to 
ensure the overall closed-loop system stability. Stability proof of the overall system is 
guaranteed via Lyapunov analysis. To demonstrate the effectiveness of the proposed 
control schemes, the overhead crane system is set up and satisfactory experimental results 
are also given. 

2. Dynamic Model of Overhead Crane 

The aim of this section is to drive the dynamic model of the overhead crane system. The 
model is dived using Lagrangian method. The schematic plotted in Figure 1 represents a 
three degree of freedom overhead crane system. To facilitate the control development, the 
following assumptions with regard to the dynamic model used to describe the motion of 
overhead crane system will be made. The dynamic model for a three degree of freedom 
(3-DOF) overhead crane system (see Figure 1) is assumed to have the following postulates. 
Al: The payload and the gantry are connected by a mass-less, rigid link. 
A2: The angular position and velocity of the payload and the rectilinear position and 
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velocity of the gantry are measurable. 

A3: The payload mass is concentrated at a point and the value of this mass is exactly 
known; moreover, the gantry mass and the length of the connecting rod are exactly known. 
A4: The hinged joint that connects the payload link to the gantry is frictionless. 




Fig. 1. 3-D Overhead Crane System 

The 3-D crane system will be derived based on Lagrange-Euler approach. Consider the 
3-dimensional overhead crane system as shown in Figure 1. The cart can move horizontally 
in x-y plane, in which the moving distance of the cart along the X-rail is denoted as x(t) and 
the distance on the Y-rail measured from the initial point of the construction frame is 
denoted as y(t). The length of the lift line is denoted as I. Define the angle between the lift 
line and its projection on the y-z plane as d{t) and the angle between the projection line 
and the negative axis as j3(t) . Then the kinetic energy and potential energy of the system 
can be found in Equation (1.1) and (1.2), respectively and be expressed as the following 
equations. 



1 



1 



•2 



1 



•2 



K = —m l x + — (m x +m 2 )y + —rn c (x c + y c + z c ) 



(1) 



V = -mgl cos a cos /? 



(2) 



where X c , y c are the related positions of the load described in the Cartesian coordinate, 
which can be mathematically written as 



x = x + l sin a 



(3) 
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y c - y + l cos a sin ft (4) 

z c = -I cos a cos P (5) 

The following equations express the velocities by taking the time derivative of above 
equations 

x c = x + Id cos a (6) 

y c = j-/d sin a sin /? + //? cos a cos/? (7) 

z c = -la sin a cos j3 - IJ3 cos a sin ft (8) 

By using the Lagrange-Euler formulation, 



d_ 
dt 



r dL^ 



dq i 



-— = r„ i = 1,2,3,4. (9) 



where L = K — V , q i is the element of vector </ = [x _y « y^] and T,. is the 
corresponding external input to the system, we have the following mathematical 
representation which formulates the system motion 

M(q)q + C(q, q) + G(q) = T (10) 

where M{q)E.R is inertia matrix of the crane system, C(q,q) G R is the 

nonlinear terms coming from the coupling of linear and rotational motion, G(c[) E R 

is the terms due to gravity, and T = [u x U 0] is the input vector. 

As mentioned previously, the dynamic equation of motion described the overhead crane 

system also have the same properties as follows 

Pl:The inertia matrix M{q) is symmetric and positive definite for all q E R n . 

P2:There exists a matrix B(q,q) such that C(q,q) = B(q,q)q , and Vx E R 4 

X T (M - 2B)x = , i.e., M -2B is skew-symmetric. B(q, q x )q y = B(q, q y )q x . 

P3:The parameters of the system can be linearly extracted as 

M(q)q + C(q,q) + G(q) = W f (q,q,q)Q> f (11) 

where W A q^q/q) is the regressor matrix and O , is a vector containing the system 

parameters. 

Dynamic Model of Overhead Crane 

In this section, an adaptive control scheme will be developed for the position tracking of an 

overhead crane system. 
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2.1 Model formulation 

For design convenience, a general coordinate is defined as follows 

q 7 =W P ql\ 



where 



q P =[x y], q e =[a p\ 

and using the relations in P2, the dynamic equation of an overhead crane (10) is partitioned 
in the following form 



M m M 



M 



pp 

T 



pO 



M c 



y p 



+ 



B B 

pp 

B ep B e 



P o 



\ 4p ~ 

lie. 


+ 


~G p (q) 
G e {q) 


= 


0_ 



(12) 



where M , M , M Qe , B , B Q , B~, B QQ are 2x2 matrices partitioned from 
the inertia matrix M{q) and the matrix B(q,q) , respectively, G , G e are 2x1 
vectors, and U =\u x U ] . Before investigating the controller design, let the error 
signals be defined as 



e = q~q d =[e T p e T e f 



(13) 



and the stable hypersurface plane is defined as 
s = e + Ke = 



K +K P e P 



(14) 



where 



e p =q p -g pd =[x-x d y-y d ] T =[e x e ] 7 



e e =qe-qm=\. a - a d P~Pdf=\. e a e sY' 





\k x 


ol 




\K 


0] 








, K e = 






p 





IVy 


' U 





K \ 



and X d , y d , CC d and j3 d are defined trajectories of X , y , OL and f3 respectively, 

and K , K are some arbitrary positive definite matrices. 

Then, after a lot of mathematical arrangements, the dynamics of the newly defined signal 



vectors S n , S a can be derived as 

p v 
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M 

M 



pp 

T 

pe 



M 



pe 



+ 



B D n B 



pp ^pe 



Bnr> B 



00 . 



T p +U p 



(15) 



where 
t p =M p P Hpd +k p e p ) +M P e(-<i e +k e e )+B pp (-q pd +k p e p )+B p0 (-q +k p e g ) (16) 

*s = M eo(-<i P ci +KK)+M gg (-q g +k g e g )+BJ-q , +ke)+B 00 {-q +k g e g ) (17) 



Remark 1: The desired trajectories X d , y d , (X d and j3 d should be carefully chosen so as 
to satisfy the internal dynamics, as shown in the lower part of equation (15), when the 
control objective is achieved, i.e., 



M P0 (q d ) 



y d 



+M eg (q d ) 



a. 



A 



+ B(p(q,q) 



y d 



+ B 00 (q,q) 



a. 



A 



+ G (q) = O (18) 



Without loss of generality, we always choose an exponentially-convergent trajectories with 
final constant values for X d , y d and zero for Oi d , j3 d . 

2.2 Adaptive Controller Design 

In this subsection, an adaptive nonlinear control will be presented to solve the tracking 
control problem. 



q p ^ P ^e^e 



q P ,q P ,<ie><ie 



Fig. 2. An Adaptive Self-tuning Controller Block Diagram 



As indicated by property P3 in section 1.2, the dynamic equations of an overhead crane 
have the well-known linear-in-parameter property. Thus, we define 
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<>\<k =M pp {q pd + k p e p )+M p0 (-q m k e ) + B pp (q pd +k p e p ) + B p0 k p e p (19) 



^iA = M ep(<i P a +k p e p ) + M 00 {k e ) + B 0p {q pd +k p e p ) + B 00 k e 6 



(20) 



where CO x , 0) 2 are regressor matrices with appropriate dimensions, and (j> x , (f> 2 are their 
corresponding vectors of unknown constant parameters, respectively. As a majority of the 
adaptive controller, the following signal is defined 



z. =< 



2(4z x a x (t) + b x (t)), 
S x ,Z x (t) = 0, 



z,(0>o 
*,(0>o 
MO^o 



(21) 



where S x is some small positive constant and 



M0 = 



¥p\\ +£ 



-(-s/(oJ 2 -s/K v0 s ) 



K(t) = 



-(-s e T G) 2 (j) 2 -s e T K ve s e ) 



\\ s p\\ +£ 



(22) 



(23) 



Remark 2: Note that (21) is simply to define a differential equation of which its variable 
Z x (t) rem ains positive. Let another signal k(t) be defined to be its positive root, i.e., 
k = JZ Y , It can be shown that 



k(t) = — ( 
*(0 



i M\ s p\\ +£ 



)(-s T a>J 2 -s T K v0 s ) k*0 



(24) 



¥p\\ +£ 



In the sequel, we will first assume that there exists a measure zero set of time sequences 
\fi U=i sucn tnat Z(fi ) - or k{t i ) = , / = 1,2,3 v- 00 > and tnen ^ verify the existence 
assumption valid. 
Now let the adaptive control law be designed as 



U p =-a) l <f> l -T v -Ks 



vp p 



(25) 
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T„ = 



(k-l)s t 



i-s/coJ 2 -s/K v0 s ) 



\\ s p\\ +£ 



(26) 



where 



<°A = M pp (<i P d + k p e p ) + M pe (V* ) + B p P (4 pd + k p e p ) + B p0 k p e p (27) 



(»2<f>2 = M 0n(<i P d + k e ) + M 00 {k e ) + B,{q d + k e ) + B 00 k e 



(28) 



and (j> x , (j) 2 are the estimates of (j> x , (f> 2 respectively, then the error dynamics can be 
obtained as 



pp 


m p ; 


V 
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B pp 


B P0 


V 


+ 


X, 


o " 


V 




o>A-? v 


P0 


M ee \ 


L^J 




B 8P 


B ee_ 


V s e\ 







**J 


UJ 




_co 2 2 +K v0 s _ 



(29) 



or more compactly as 



M{q)s + h(q, q)s + Ks = 



^2 +K vO S 6 



(30) 



where 



f 2 - <f> 2 



(31) 



Moreover, let the adaptation laws be chosen as 



A = - k a®X S p 

k = -k b co 2 s e 



(32) 



where k a , k b are some positive definite gain matrices. In what follows we will show that 
the error dynamics (30) along with the adaptive laws (32) constitutes an asymptotically 
stable closed-loop dynamic system. This is exactly stated in the following theorem. 
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Theorem : Consider the 3-D overhead crane system as mathematically described in (10) or (12) with 
all the system parameters unknown. Then, by applying control laws (25)-(28) and adaptive laws (32), 
the objective for the tracking control problem can be achieved, i.e., all signals inside the closed-loop 
system (29) are bounded and e x , e a , e , e n — > asymptotically in the sense ofLyapunov. 

Proof: Define the Lyapunov function candidate as 

V{t) = ±s T M(q)s + ^ x T k^ x +^ 2 T K l l +±Z x 



2 



1 7r , -I 7 1 ~Tj-l~ * 



= —s T M(q)s + — l T k a l l + —(/) 2 T k b x <p 2 + —k 2 



It is obvious that, due to the quadratic form of system states as well as the definition of 
Z x (t) , V(t) is always positive-definite and indeed a Lyapunov function candidate. By 
taking the time derivative of V we get 



V(t) = s T M(q)s + — s T M(q)s + l T k a l <j> x + <j> 2 k b p 2 +kk 



= s T (-B(q,q)s-Ks + 



-(D x <f> x -r v 



) + -s T M{q)s + s p T co x <f> x +s T co 2 <t> 2 



k\s p \ +£ T A T 
+ (-^Ti X~^ ®2<I>2-Se K ve s e) 



\\ s p\\ +£ 



= -sKs - s p T co x <t> x - ( j: J - )(-s/coJ 2 - s e T K v0 s e ) + s e T co 2 <f> 2 + s p T co x <f> x 



f P \\ +£ 



h-\\ C -I- f 

+ (~TT 2 ^~ s e T( °ik -^/K v0 s ) + s/coJ 2 +s t K v0 s 6 

\\ s p\\ +£ 



= -s'Ks 



(33) 



On the Adaptive Tracking Control of 3-D Overhead Crane Systems 



287 



It is clear that V(t) < as long as K > , which then implies S, k, <fi l9 <fi 2 G L^ Now, 

assume that k(t) = instantaneously at t t . Because the solution Z x (f) of the equation 

(21) is well defined and is continuous for all t > , k(t) is continuous at t t , i.e., 

k(t t — ) = k(t l : +) . Since V is a continuous function of k , it is clear that V{t) remains 

to be continuous at t t , i.e. , V(t t — ) = V(t t +) . Form then hypothesis, V(t t ) < and 

K(7- ) < 0, we hence can conclude that V is nonincreasing in t including t i , which then 

readily implies that S,k E L^ . Therefore, ^,7" v and T Q gI^ directly from equation 

(13) and definitions of T v and T ' e . It then followsf irpm (30) that S E L^ . On the other 

hand, if the set of time instants *■ l ' i=l is measure zero, then 

r r ii ii2 

- I ^Vdt = V(0) - V(co) < oo or equivalently that - I * p| dt < CO so that 5 E L 2 . 

Form the error dynamics, we can further conclusion that S E L^ . Then by Barbalat's 

lemma we readily obtain that S — > as t — > oc asymptotically as t — > oc and therefore, 

e 5 g->0 as ^ — > oc Note that in the above proof we have used the property 

{M{q) — 2B{q^q)) is skew- symmetric. Finally, to complete the proof in theory, we 

need to show that the above hypothesis that the set of time instants \t t }. =1 is indeed 

measure zero. However, it is quite straightforward to conclude the result from (21) by 

simply using the fact that all signals are bounded. This completes our proof. 



Remark 3: From the robustness point of view, it would be better if additional feedback term 
— ks e is included in the control law (24). With such an inclusion, the sway stabilization 
result subject to external disturbance can also be maintained as the cart arrived at its 
destination. This can be easily checked from the stability proof given in the theorem. 
Proof: Let the Lyapunov function candidate be chosen as 

V(t) = j S T M(q) S + y£ 7 V 1 £ + ^$ 2 T k b - l $ 2 + X -Z x 
and take the time derivative of V to get 



T, -1 ' 



' T , -1 ' 



V(t) = s T M(q)s + -s T M(q)s + ft k a ~ l (f> x + ft k~ l (f> 2 + kk 



s T {-B{q,q)s-K vp s + 



®2<l>2+ K ve S 



"V« 



ve^e 



) + -s T M(q)s + s p T 6) l l +s/(D 2 2 
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+(- 



+ £ 



)(-s/coJ 2 -s T K v0 s g ) 



+ £ 



= -s Ks-k q s g s p 



V(t) = -s T K v s-k q s e s p 



2 2> 



V(t)<-A miD (K v )(s p 2 +s ff 2 ) + ±k q (s; +s/) 



= -(^ a (K v )-h q ){s p 2 +s 2 ) 



Thus, the same conclusion can be made as preciously if 

^ m (K v )>\k 



3. Computer Simulation 

In this subsection, several simulations are performed and the results also confirm the 
validity of our proposed controller. The desired positions for X and Y axes are 1 m. Figure 3 
shows the time response of X-direction. Figure 5 show the time responses of Y-direction. It 
can be seen that the cart can simultaneously achieve the desired positions in both X and Y 
axes in approximately 6 seconds with the sway angles almost converging to zero at the same 
time. Figure 4 and Figure 6 show the response of the sway angle with the control scheme. 
Figure 7 and Figure 8 show the velocity response of both X-direction and Y-direction. Figure 
9 and Figure 10 show the control input magnitude. In Figure 11-14, the parameter estimates 
are seen to converge to some constants when error tends to zero asymptotically and the time 
response of the tuning function k(t) is plotted in Figure 15. 
The control gains are chosen to be 
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K = 
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>Ke = 





1.2 



The corresponding adaptive gains are set to be k a = k b = 1 
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Fig. 3. Gantry Tracking Response x(t) with Adaptive Algorithm 
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Fig. 4. Sway Angle Response Gc(t) with Adaptive Algorithm 
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Fig. 5. Gantry Tracking Response J>(£) with Adaptive Algorithm 
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Fig. 6. Sway Angle Response j3(t) with Adaptive Algorithm 
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Fig. 7. Gantry Velocity Response x(t) with Adaptive Algorithm 
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Fig. 8. Gantry Velocity Response y(t) with Adaptive Algorithm 
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Fig. 9. Force Input U x 
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Fig. 10. Force Input U 
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Fig. 11. Estimated Parameters <fi lx (t) 
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Fig. 12. Estimated Parameters 
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Fig. 13. Estimated Parameters (f) x {£) 



10 15 20 25 

Time(sec) 



A l 








ft i 








1 \ '/\ 








1 ^ 








| 








i 








; 








JfJ 












/ Kj) J ! ' 
















v u^ \ 








V 









Fig. 14. Estimated Parameters (f) 2a {£) 
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Fig. 15. Response Trajectory of k(t) 

4. Experimental Verification 

In this section, to validate the practical application of the proposed algorithms, a three 
degree-of-freedom overhead crane apparatus, is built up as shown in Figure 16. Several 
experiments are also performed and indicated in the subsequent section for demonstration 
of the effectiveness of the proposed controller. 




Fig. 16. Experimental setup for the overhead crane system 



The control algorithm is implemented on a xPC Target for use with real time Workshop® 
manufactured by The Math Works, Inc., and the xPC target is inserted in a Pentium4 
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2.4GHz PC running under the Windows operating system. The sensing system includes the 
two photo encoders and two linear position sensors. The cart motion X-direction and 
Y-direction motion measured by linear potentiometer. Two potentiometers are connected to 
the travel direction and the traverse direction. An AC servo motor with 0.95 N-m maximum 
torque and 3.8N-m maximum torque output is used to drive the cart motion X direction and 
Y direction. The servomotors are set in torque control mode so as to output the desired 
torques. 

In the experimental study, the proposed control algorithms have been tested and compared 
with the conventional PD controller. From the experimental results, it is found that our 
proposed algorithms indeed outperform the conventional control scheme in all aspects. A 
schematic description of the experimental system is draw in Figure 17. 



. linear position transduc ers 




Trolley position 



Proposed measurement and 
Connol sjjEtem 



Mrtnr control unit 



Fig. 17. A Schematic Overview of the Experimental Setup 



4.1 Experiments for Conventional PD control as a comparative study 

In the experiments, a simple PD control scheme with only position and velocity feedback is 
first tested for the crane control. Figure 18 and Figure 20 show the control responses. From 
Figure 19 and Figure 21 it is observed that the sway oscillation can not be rapidly damped 
by using only conventional PD control, although the tracking objective is ultimately 
achieved. 
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Fig. 18. Gantry Tracking Response x(t) with Conventional PD Control 
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Fig. 19. Sway Angle Response Oi{t) with Conventional PD Control 
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Fig. 21. Sway Angle Response j3(t) with Conventional PD Control 
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Fig. 20. Gantry Tracking Response y(t) with Conventional PD Control 

4.2 Experiments for the Proposed Adaptive Control Method with Set-point Regulation 

In the subsection, the developed adaptive controller is applied. The following controlled 
gains are chosen for experiments. 
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The corresponding adaptive gains are set to be 1 i.e., k a — k b = 1 . Figure 22-31 depict the 
experimental results for the crane system with the adaptive control law. Figure 22 and 
Figure 24 demonstrate the tracking performance in X and Y directions. It is experimentally 
demonstrated that the sway angle can be actively damped out by using our proposed 
adaptive schemes, as shown in Figure 23 and Figure 25 with maximum swing angle about 
0.05 rad and 0.06 rad, respectively. Figure 26 and Figure 27 show the input torques from 
each AC servo motors, whereas Figure 28-30 plot the associated adaptive gain turning 
trajectories. The trajectory of coupling gain k(t) is also in Figure 31 with initial value 0.05. 
The initial values of other state variable are all zero. Apparently the tracking and damping 
performances by applying the adaptive control algorithm are much better than the ones 
resulting from the PD control. 
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Fig. 22. Gantry Tracking Response with Adaptive Algorithm X(t) 




Fig. 23. Sway Angle Response with Adaptive Algorithm CC (t) 
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Fig. 24. Gantry Tracking Response with Adaptive Algorithm Y(t) 



298 



Adaptive Control 




10 12 14 



Fig. 25. Sway Angle Response with Adaptive Algorithm /? (t) 




Fig. 26. Force Input Ux 




Fig. 27. Force Input Uy 
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Fig. 28. Estimated Parameters (j) lx (t) 
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Fig. 29. Estimated Parameters (j> x (t) 



300 



Adaptive Control 



u.y 










1 


1 


1 


1 


- 


0.8 










0.7 


















- 


0.6 


















- 


0.5 


















- 


0.4 


















- 


0.3 


















- 


0.2 


















- 


0.1 


















- 



[ 




















) 


2 


4 


6 


8 


10 


12 


14 


16 












Tiine(sec) 











Fig. 31. Trajectory of k (t) 



4.3 Experiments for the Proposed Adaptive Control with Square Wave Tracking 

To prove the prevalence of our controllers, experiments on the tracking of square wave, as 
shown in Figure 6, is also conducted. The gains are kept the same as in the previous 
experiments. Figure 6(a) and Figure 6(c) demonstrate the tracking performance in X and Y 
directions, respectively while Figure 6(b) and Figure 6(d) show the suppression results of 
sway angles. It is found that good performance can still be preserved is spite of the sudden 
change of desired position. 
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Fig. 32. Desired Trajectory 
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Fig. 33. Tracking Response x(t) with Adaptive Algorithm 




Fig. 34. Sway Angle Response (X (t) with Adaptive Algorithm 




Fig. 35. Tracking Response y(t) with Adaptive Algorithm 
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Fig. 36. Sway Angle Response j3(t) with Adaptive Algorithm 
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Fig. 37. Trajectory of k(t) 

5. Conclusion 

In this chapter, a nonlinear adaptive control law has been presented for the motion control 
of overhead crane. By utilizing a Lyapunov-based stability analysis, we can achieve 
asymptotic tracking of the crane position and stabilization of payload sway angle for an 
overhead crane system which is subject to both under actuation and parametric 
uncertainties. Comparative simulation studies have been performed to validate the 
proposed control algorithm. To practically validate the proposed adaptive schemes, an 
overhead crane system is built up and experiments are also conducted. Both simulations 
and experiments show better performance in comparison with the conventional PD control. 
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APPENDIX A 

Mathematical Description of The Dynamic Model 

The dynamic equation of the 3D overhead crane system can be derived by using 
Largrange-Euler formula and shown in the following 

M(q)q + C(q,q) + G(q) = T 

where 



M(q) = 



m x +m c 








m x + m 2 + m c 


mjcosa 


-rnJsinasm/3 





mjcosacosp 



mjcosa 

rnJsinasin/3 mj cosacos/? 
mj 2 

ml 2 cos 2 a 
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C(q,q) = 



- mjd sin a 
mj(d 2 + p )cos a sin /? - 2m Jd/3 sin a cos /? 
mj 2 p sin a cos a 
- 2m J 2 d0 sin a cos a 



Gfo) = 






- m c gl since cos j3 
-rn c glcosasin{3 



t = [u x u y of 
q = [x y a J3\ 



To satisfy property P2 as stated in section 2 the vector C(q,q) can be re-arranged as 
C(q, q) = B(q, q)q where 



B(q 9 q) 



- mjd 2 sin a 

0- mjd cos or sin /? - m c //? sin a cos /? 



- mj 2 j3 sin a cos /? 



- m c /a sin a cos /? - m J ft cos a sin J3 

m c / 2 sin a cos J3 

- m c l 2 d sin a cos /? 



It can be easily checked that 
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M-2C = 







- mjd sin a -mjd cos a sin j3 - mjj3 sin a cos ji 


mjd sin a 

m c /dr cos a sin /? + m c //? sin # cos /? 

-2mJ 2 PsinacosP 

2mJ 2 PsinacosP 



which is skew-symmetrical matrix. 
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1. Introduction 



In recent years, control Lyapunov functions (CLFs) and CLF-based control designs have 
attracted much attention in nonlinear control theory. Particularly, CLF-based inverse 
optimal controllers are some of the most effective controllers for nonlinear systems [Sontag 
(1989); Freeman & Kokotovic (1996); Sepulchre et al. (1997); Li & Krstic (1997); Krstic & Li 
(1998)]. These controllers minimize a meaningful cost function and guarantee the optimality 
and a stability margin. Moreover, we can obtain the optimal controller without solving the 
Hamilton-Jacobi equation. An inverse optimal controller with input constraints has also 
been proposed [Nakamura et al. (2007)]. On the other hand, these controllers assume that 
the desired state of the controlled system is an equilibrium state. Then, if the controlled 
system does not satisfy the assumption, we have to use a pre-feedback control design 
method to the assumption is virtually satisfied. However, a pre-feedback control design 
causes the luck of robustness. This implies that a stability margin of inverse optimal 
controllers is lost. Hence the designed controller does not asymptotically stabilize the 
system if there exists a parameter uncertainty in the system. 

In this article, we study how to guarantee a stability margin when the pre-feedback 
controller design is used. We consider a magnetic levitation system as an actual control 
example and propose an adaptive inverse optimal controller which guarantees a gain 
margin for the system. The proposed controller consists of a conventional inverse optimal 
controller and a pre-feedback compensator with an adaptive control mechanism. By 
introducing adaptive control law based on adaptive control Lyapunov functions (A CLFs), 
we can successfully guarantee the gain margin for the closed loop system. Furthermore, we 
apply the proposed method to the actual magnetic levitation system and confirm its 
effectiveness by experiments. 

This article is organized as follows. Section 2 introduces some mathematical notation and 
definitions, and outlines the previous results of CLF-based inverse optimal control design. 
Section 3 describes the experimental setup of the magnetic levitation system and its 
mathematical model. In section 4, we design an inverse optimal controller with a pre- 
feedback compensator for the magnetic levitation system. The problem with the designed 
controller is demonstrated by the experiment in section 5. To deal with the problem, we 
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propose an adaptive inverse optimal controller in section 6. The effectiveness of the 
proposed controller is confirmed by the experiment in section 7. Section 8 is devoted to 
concluding remarks. 

2. Preliminaries 

In this section, we introduce some mathematical definitions and preliminary results of CLF- 
based inverse optimal control. We also refer to ACLF-based adaptive control techniques. 

2.1 Mathematical notations and definitions 

We use the notation R > ;= [0, oo) . 

Definition 1 A function sgn(y) is defined for y e R by the following equation: 



sgn(y) = 



-i (y<o) 

(y = 0) (1) 

[1 (y>0). 



In this section, we consider the following input affine nonlinear system: 

x=f(x) + g(x)u, (2) 

where x e R n is a state vector, u eU <^R m is an input vector and U is a convex subspace 
containing the origin u = . We assume that f : R n -^ R n and g : R n -> R nxm are continuous 
vector fields, and f(0) = 0- Let L f V an d L V be the Lie derivative of f(x) and g(x) 
respectively, which are defined by 

i,vw = ff M , (3) 

W) = f gW . (4) 

For simplicity of notations, we shall drop (x) in the remaining of this article. We suppose 
that a local control Lyapunov function is given for system (2). 

Definition 2 A smooth proper positive-definite function V : X — » R >0 defined on a 

neighborhood of the origin X cz R n is said to be a local control Lyapunov function (local 
CLF) for system (2) if the condition 

mi{L f V + L g Vu}<0 (5) 
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is satisfied for all x e X \ {0} . Moreover, V(x) is said to be a control Lyapunov function 
(CLF) for system (2) if V(x) is a function defined on entire R n and condition (3) is satisfied 
for all xeR n \{0}. 

If there exists no input constraint (U = R m ), a smooth proper positive-definite function 
V :R n ->R >0 is a CLF if and only if 



L g V = 0^L f V<0,Vx*0. 



(6) 



In this article, we guarantee the robustness of controllers by sector margins and gain 
margins. 

Definition 3 A locally Lipschitz continuous mapping (j){u) e R m is said to be a sector 
nonlinearity in (a, /?) with respect to ueR m if the following conditions are satisfied: 



au T u < u T (j)(u) < (3u T u,\/u * 0, 
#0) = 0. 



(7) 



Definition 4 System (2) is said to have a sector margin {a, p) with respect to uefi™ if the 
closed system 



x=f(x) + g(x)0(u) 



(8) 



is asymptotically stable, where ^(u) is any sector nonlinearity in (a,P) with respect to 
ueR m . 

Definition 5 System (2) is said to have a gain margin (a, p) with respect to u eR m if the 
closed system (8) is asymptotically stable, when (j)(u) is given as follows: 



^(u) = hii,Ke(a,/3). 



(9) 



By the definition, gain margins are the special case of sector margins. If system (2) has a 
sector margin (a, ft) , it also has a gain margin (a, ft) . 

2.2 Inverse optimal controller 

We introduce the inverse optimal controller proposed by Nakamura et al [Nakamura et al. 
(2007)]. The following results are obtained for system (2) with input constraint 



Vr:= 



uefi" 



«i= IN I <c « 



(10) 
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where 1 < k < oo is a constant and C(x) > is continuous on R n . 

Theorem 1 We consider system (2) with input constraint (10). Let V(x) be a local CLF for 
system (2) and a x > be the maximum number satisfying 



inf \L f V + L n V-u\<0, 

u&r f 9 } 

yxeW 1 :={x\V(x)<a 1 \ 



(11) 



Then, W 1 is a domain in which the origin is asymptotically stabilizable. If V(x) is a CLF, 
then a x = oo and W 1 = R n • 

Theorem 2 We consider system (2) with input constraint (10) . Let V(x) be a local CLF for 
system (2) , P(x) be a function defined by 



P(x) = 



L f V 



c (4 L «n 



(12) 



and a r e (0, aj be the maximum number such that the condition 

mfU f V+ k ~ 1 ' 



uzU£ 



k -L g V-u}<0, 



VxeW r :={x|V(x)<a r } 
is satisfied, and d be a positive constant. Then, input 

1 



R(x) 



L 3 y k - 1 



Sg] 



n(LV) (i=X-,m), 



(13) 



(14) 



R(x) = 



(2 + q (A/» i/v 



k-1 



(P + |p|) + q(x)|L e VV 



q(x) 



(V*°) 



(L„V = 0), 



(15) 



q(x) = dC' ( - 1 (x) 



1 



(16) 
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asymptotically stabilizes the origin in W , and minimizes the cost function: 

r fL x R k ~\x)\\ II* L 

° [ k J (17) 

fc-1 1 „ „— 

'00 = L.VV-LfV. 

k B(x)» 9 »£ f 

Moreover, it achieves at least a sector margin ((2, /?) in W r • 

2.3 Adaptive control problem 

We consider an adaptive control problem for nonlinear systems. In this section, we 
introduce some definitions and properties. We consider the following input affine nonlinear 
system: 

x=f o (x)+f l (x)0 + g(x)u, (18) 

where x e R n is a state vector, u e R m is an input vector, and 6 e R p is a constant unknown 
parameter vector. We assume that f :R n -^ R n , g : R n -» R nxm and f x : R n -^ R nxp are 
continuous vector fields, and f Q (0) = • Note that there exists no input constraint. 
The stabilizability of the system with unknown parameters is defined as the following. 

Definition 6 Let 6 be an estimate of 6 . We say that (18) is globally adaptively stabilizable 
if there exist a function a(x, 6) continuous on R n \{0}xR p with a(0,§) = 0, a continuous 
function t(x, 6) , and a positive definite symmetric px p matrix T , such that the dynamic 
controller 

u = a(x,§), , 19 , 

3 = Tt(x,3) (20) 

guarantees that the solution (x, 6) is globally bounded, and x -^ as t -^ oo for any value 

of the unknown parameter 6 e R p . 

For the stabilization problem, we introduce an adaptive control Lyapunov function (ACLF) 

as the following. 

Definition 7 We consider system (18) and assume that y (x,0) is a CLF for system (18). 
Then, y (x, #) is called an adaptive control Lyapunov function (ACLF) for system (18) if 
there exists a positive-definite symmetric matrix T such that for each 6 e R p , V is a CLF 
for the modified system 
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*= foW+fiW 



do 



+ g(x)u. (21) 



Krstic et al. (1995) proved the following theorem. 
Theorem 3 The following two statements are equivalent: 

(1) There exists a triple (a,V ,T) such that a(x,6) globally asymptotically stabilizes (21) at 

x = for each 6 e R p with respect to the Lyapunov function y (*, 0) . 

(2) There exists an ACLF V a (x, 6) for system (18). 

3. Magnetic Levitation System 

3.1 System configuration 

We consider a stabilization problem of a magnetic levitation system shown in Fig. 
l[Mizutani et al. (2004)]. The system consists of a magnet with a disk, a glass guide rod, 
upper and lower magnetic drive coils that generate a magnetic field in response to a DC 
current and two laser-based sensors that measure the magnetic position using the reflection 
of the disk surface. 

3.2 Mathematical model of the system 

In this article, we control the position of the magnet using attractive force generated by the 
upper drive magnetic coil. The force diagram is illustrated in Fig. 2. £ is the position of the 
magnet from the upper coil, and F is an attractive force for the magnet generated by the 

upper drive magnetic coil. 

The dynamical equation for the magnet is described by 

m'i = F u - mjui -mg , ( 22 ) 

where m is the mass of the magnet, ju is a friction constant. g Q is the gravitational 
acceleration. 




Fig. 1. Magnetic levitation system 
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Electromagnet 



Disk Magnet 



E, 



U 



< 



v 
Fig. 2. Force diagram of the magnetic levitation system 



Here, F is modeled by 



F = 



a(-t + b) 4 



(23) 



where a and b are constants determined by numerical modeling of the magnetic 
configuration, and u is a control input voltage for the upper coil. The position £ is 
measured by the upper laser sensor. 

Let £* be the desired position of the magnet, x 1 =£-£*, and x 2 = x 1 - We set x = [x 1 ,x 2 ] T . 
Then we obtain the following state equation: 



x= f(x) + g(x)u, 



where f(x) and g(x) are defined as 



fix)'- 



,000 = 



The system parameters are shown in Table 1. 



maf-^ - £* + b) 4 



Table 1. Parameter values of the magnetic levitation system 



(24) 



(25) 



m 
[kg] 


M 
[-] 


9o 
[m/s2] 


[V/N-m*] 


5 
[m] 


0.12 


4.5 


9.80665 


40118.9 


0.056464 
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There exists the following input constraint in system (24): 

|u| 2 =V^<5[V]. (26) 

By the above discussion, the control problem is reduced to the stabilization problem of 
system (24) with the input constraint (26). 

4. Pre-feedback Gravity Compensation 

In system (2), we assume that f(Q) = 0- However, f (0) ^ in system (24). Therefore, we 
cannot directly apply the inverse optimal controller (14) to system (24). To achieve f(0) = , 
we design a controller to compensate for gravity by a pre-feedback input. 
We consider the following gravity compensation input u (x) as 

u c (x) = mg a(-x 1 -f +b)\ (27) 

Substituting (27) into (24), the gravitational acceleration g Q is successfully canceled. Then, 
we split the input u(x) using u (x) as 

u(x) = u c (x) + u s (x), (28) 

where u s is an asymptotic stabilizing input for system (24) when g Q = . 
By using (26) and (28), the input constraint is rewritten to 

|u(x)| 2 =|u c (x) + u s (x)| 2 <5. (29) 

To handle input constraint (29) as a norm constraint, we rewrite (29) as 

|u,(x)|<||5-|u c (x)| \[:=C(x). (30) 

(30) represents a constraint depending on the state. Note that constraint (30) is more severe 
than the original constraint (29). The problem of designing controller (28) is reduced to the 
problem of designing controller u s (x) with input constraint L (x)| < C(x) • 

To apply inverse optimal controller (14), we construct a CLF for system (24). In general, the 
controller performance often depends on a CLF. However, it is unclear which CLF achieves 
the best control performance. Hence, we construct a CLF with a design parameter. Using the 
integrator backstepping method, a CLF V(x) can be carried out as 

V(x) = ^(r 2 +1K 2 + rx,x 2 + ^x 2 2 , (31) 
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where r is a positive constant and also a design parameter. 
Now, we construct input u s ■ Let f (*) be the function defined by 



jUX 2 



f (x)=f(x)\ go=0 = 
By using (31), we can calculate l V an d LV as 

to 9 

L f V = {(r 2 +l)-jUr}x l x 2 +(r-jU)x 2 



LV = ™i+*2 

ma(-x l -E, +b) 



(32) 

(33) 
(34) 



Substituting (33) and (34) into (14) and (15), we get the following input u (x) • 



R l (x) = 







u,(x) = 


Kl(x) 9 


(2 + 


q(4 L 9 V \ 


2 )|M 


2 cr v^m 


m 


+ |P 1 |) + q(x)|l 9 V 


1% V ^ v) 

2 




2 
q(x) 




(L 9 V = 0), 






P 1 (x) = 


L J 




c wlML' 



q(x) = dC(x) = dJ5-\u c (x)\ | 

According to Theorem 2, u (x) has a sector margin (1/ 2, oo) . 
Finally, the following controller u(x) is obtained: 



u(x) = mg a(-x 1 -4' +b) 4 



L.V. 



(35) 
(36) 

(37) 
(38) 



(39) 



5. Experiment 1 

We apply controller (39) to the magnetic levitation system. We set x(0) = [-1.4, 0.o] T and 
<f = -2.0 [cm]. The controller is implemented by MATLAB/SIMULINK. The sampling 
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interval is lx 10 3 [sec] and control parameters are r = 8 and d = 1.25 x 10 4 , respectively. 
The time response of the controlled system is shown in Fig. 3. Although the velocity x 2 
vibrates due to sensor noise, the input constraint (26) is satisfied. However, the position x 1 

does not converge to zero (an offset error remains). Then, the actual magnetic levitation 

system is not asymptotically stabilized by the proposed controller (39). 

The biggest reason for the offset error is the lack of robustness with respect to u -If there 

exists a parameter uncertainty in g(x), the gravitational acceleration g Q is not completely 

canceled by the pre-feedback u (x) • Therefore, the proposed controller u(x) does not 

guarantee the robustness for the system (24) even if the stabilizing input u (x) guarantees 

the sector margin (1/ 2, oo) for the system 



*= f o (*) + 0(*K- 



(40) 



0.2 



-0.2 

-0.-0.4 

■% -0.6 

-0.8 

-1 

-1.2 

-1.4 







r^ - 


^-^ 




offset error 























4 6 

Time [sec] 



10 





4 6 

Time [sec] 



Fig. 3. Experimental result of controller (39) 



6. Adaptive Inverse Optimal Controller Design 

6.1 Robustness recovery via adaptive control 

To solve the problem stated in section 5, we propose a controller that guarantees a gain 
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margin for u . We apply an adaptive control technique to achieve a gain margin for input u . 
Before applying the adaptive controller, we rewrite the system (24) to 

*= f M+ fi( x )9o + 9( x ) u > (41) 

where fJx) is defined by (31) and f^x) = [0,-l] T • Additionally, to consider a gain margin 
for (41), we rewrite the system to 

x = f o (x) + Kf 1 (x)0 + Kg(x)u , (42) 

where k is an unknown constant and -= g / k is a constant unknown parameter. Note 

that the range of k , in which the origin of the system (42) is asymptotically stable, is a gain 
margin for input u . Furthermore, we consider the following input: 

u(x,0) = u c (x,0) + u' s (x,0), (43) 

where Q\- g / re and k is an estimate of k . We suppose that input u'(x,§) asymptotically 
stabilizes the system (40) and guarantees the gain margin (1/ 2,oo) . Let u (x, 0) be a gravity 
compensation input defined as follows: 

u c (x,0) = madi-x.-f+b) 4 . (44) 

Remark 1 In this section, we do not mention whether the input constraints exist or not. 

Then, we construct an adaptive law such that the input (43) stabilizes the system (42) and 
show the input (43) has a gain margin (1/ 2, oo) . 

In this section, we use an ACLF to construct an adaptive law. The following lemma is 
available for constructing an ACLF. 

Lemma 1 We consider system (42). Let V(x) be a CLF for system (41). Then, V(x) is an 
ACLF for system (42). 

Proof: If V(x) is an ACLF for system (42), V(x) is a CLF for the following system: 

x= f (x) + *f 1 (x)(^ + / — j + /^(x)u, (45) 

where y is a positive constant. Note that dV 1 80 = , the above system is rewritten to 

*= f o (x)^Kf 1 (x)0 + Kg(x)u. (46) 
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Sytem (46) is asymptotically stabilized by the input 

u(x) = Oi c (x) + u s (x), (47) 

where u (x) and u (x) are defined by (27) and (35) respectively. This implies all CLFs for 
system (41) are ACLFs for system (42). 

By Lemma 1, CLF (31) is applicable to an ACLF for system (42). 

Lemma 2 We consider system (42) and assume that an ACLF V(x) for (42) is obtained. Let 
V'(x, 6) be a function defined by 

V'(x,0) = V(x) + ^(0-0) 2 =V(x) + ^-e\ (48) 

where 1/ 2 < k < oo and 6 := 6 - 6 . Let the adaptive law 6 be 

e = YT(x) = Y d ^f 1 {x). (49) 

dx 

Then, V'(x, 6) is a Lyapunov function for the closed loop system of (42). 

Proof: Let the origin of system (42) be (x,§) = (0,0) • Then, V' is a positive definite function. 
Assume u is input (43) and note that 6 = -6 . Then, 

%\ -1 (50) 

= -f o (x)^(xK(x,^)<0. 
ox 

Since the input u'(x,3) has a gain margin (l/2,oo), V'(x, 6) is less than or equal to zero. 
Then V'(x, 6) is a Lyapunov function for the closed loop system of (42) and the origin 
(x,#) = (O,60 is stable. 

Remark 2 Lyapunov function (48) contains an unknown constant k . However, it does not 
become a problem because both input (43) and adaptive law (49) do not contain k . 

Lemma 3 We consider system (42) and assume that an ACLF V(x) for (42) is obtained. 
Then, if 1/ 2 < k < oo , x -^ 0(t — > oo) and § — > Q(t — > oo) are achieved by input (43) and 
adaptive law (49). 
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Proof: By Lemma 2, we can construct a Lyapunov function V'(x, 6) (47) for system (41). 
The input and the adaptive law are given by (42) and (48), respectively. Then, we obtain 
V f (x,6) < 0(x ^ 0) because the input u'(x,6) has a gain margin (l/2,oo). Let S be a set 
defined by 

S:={(x,6)\v'(x,6) = 0,xeR n ,6eR}, 

- - (51) 

= {(x,6)\x = 0,6eR}. 

We show that the largest invariant set contained in S consists of only a point (x, 6) = (0, 6) • 
Consider the following solution of (42) belonging to S : 

x(t) = 0,t>0. (52) 

Note that u' (0, 6) = , we obtain the following equation for (42): 

x=f o (O) + K{f 1 (O)0 + g(O)u(O,§)\ 

= *:{f 1 (O)0 + g(O)u c (x,0)\ ( 53 ) 

= ^(0-0)^0, 

where k ^ and f x (0)^0, we obtain 6 = 6. On the other hand, if x = and 6^6, we 
obtain x^0 by (50). Therefore, the largest invariant set contained in S is a set {(0,#)}. 

Finally, we obtain x -^ and 6 -^ 6 when t -^ oo by LaSalle's invariance principle [Khalil 
(2002)]. 

The following theorem is obtained by Lemmas 2 and 3. 

Theorem 4 We consider system (42), controller (43) and adaptive law (49). Then, the 
controller has a gain margin (l/2,oo) . 

6.2 Adaptive inverse optimal controller 

We calculate 6 of (49) by using CLF (31) as: 



6 = y[(r 2 + l)x 1 + rx 2 rx 1 + x 2 \ 
= -y(rx 1 +x 2 ). 



(54) 



Furthermore, taking into consideration the input constraint, we obtain the following 
controller: 
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u(x, 0) = u c (x, 0) + u' s (x, 0) = ma§{-x 1 - %" + b) 4 + u' s (x, 0), 



u' s (x,0) = - 



R 2 (x,0) 



~^LV,' 



(55) 
(56) 



R 2 (x,0) = 



(2 + q(x,0)\\L g V\\)\\L g V\\ 
2(P 2 +\P 2 \) + q(x,0)\\L g V\\ 



9 112 



q(x,0) 



(LY = 0), 



P 2 (x,0)- 



L f V 

To 



C(x,0)LV 



(57) 



(58) 



q(x,0) = dC(x,0), 



C(x,0) = 



5-\u c (x,0)\ 



(59) 
(60) 



where we use u (x) given by (35) as u' (x,3) • Then, note that the input constraint C(x) is 
rewritten to C(x, 6) given by (60). According to Lemma 2 and the result of [Nakamura et al. 
(2007)], we can show the input u' (x,3) minimizes the following cost function: 



j = ]«x,e )+ ^u-dt, 



(61) 



where 



KX,ff)- 



1 II l|2 

" \v\\ -L f V. 

9 1 1 2 / o 



2R 2 {x,0) 



(62) 



It is obvious that a gain margin (l/2,oo) is guaranteed for controller (55) at least in the 
neighborhood of the origin. 



7. Experiment 2 

In this section, we apply controller (55) to the magnetic levitation system and confirm its 
effectiveness by the experiment. To consider the input constraint, we employ the following 
adaptive law with projection instead of (54): 
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= 2g Q ,rx l + x 2 >0 

6 = Q,rx 1 + x 2 <0 . 

- y(rx l + x 2 ) otherwise 



(63) 



We set the adaptation gain ^ = 160 and the initial value of the estimate §(0) = 820 . The 
other experimental conditions and control parameters are the same as in section 5. The 
experimental result is shown in Fig. 5. Position x 1 converges to zero without any tuning of 

control parameters. The gain margin guaranteed by the adaptive law seems quite effective. 
We can observe that the input is larger than the non-adaptive controller (39), however, the 
input constraint is satisfied. The parameter estimate 6 also tends to converge to the true 
value 6 . As a result, the effectiveness of the proposed controller (55) is confirmed. 




Fig. 4. Experimental result of controller (55) 



8. Conclusion 

In this article, we proposed an adaptive inverse optimal controller for the magnetic 
levitation system. First, we designed an inverse optimal controller with a pre-feedback 
gravity compensator and applied it to the magnetic levitation system. However, this 
controller cannot guarantee any stability margin. We demonstrated that the controller did 
not work well (offset error remained) in the experiment. Hence, we proposed an improved 
controller via an adaptive control technique to guarantee the stability margin. Finally, we 



322 Adaptive Control 

confirmed the effectiveness of the proposed adaptive inverse optimal controller by the 
experiment. As a result, we achieved offset-free control performance. 
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1. Introduction 

In the unmanned ground vehicle (UGV) case, the estimation of a future position with a 
present one is one of the most important techniques (Madhavan & Schlenoff, 2004). 
Generally, the famous global positioning system (GPS) has been widely used for position 
tracking because of its good performance (Torrieri, 1984; Kim et al., 2006). However, there 
exist some defects. For example, it needs a separate receiver and it must have at least three 
available satellite signals. Moreover it is also vulnerable to the indoor case (Gleanson, 2006) 
or the reflected signal fading. 

There have been many researches to substitute or to assist the GPS. One of them is the 
method of using the time difference of arrival (TDoA) which needs no special equipment 
and can be operated in indoor multipath situation (Najar & Vidal, 2001). The TDoA means 
an arrival time difference of signals transmitted from a mobile station to each base station. It 
is the basic concept of estimation that the position of a mobile station can be obtained from 
the crossing point of hyperbolic curves which are derived from the definition of TDoA. 
Including some uncertainties, there have been several approaches to find the solution of 
TDoA based geolocation problem using the least square method, for example, Tayler series 
method (Xiong et al, 2003), Chan's method (Ho & Chan, 1993), and WLS method (Liu et al, 
2006). However in case of a moving source, it demands a huge amount of computational 
efforts each step, so it is required to use a method which demands less computational time. 
As a breakthrough to this problem, the application of EKF can be reasonable. 
The modeling errors happen in the procedure of linear approximation for system behaviors 
to track the moving source's position. The divergence caused from the modeling errors is a 
critical problem in Kalman filter applications (Julier & Uhlmann, 2004). The standard 
Kalman filter cannot ensure completely the error convergence because of the limited 
knowledge of the system's dynamical model and the measurement noise. In real 
circumstances, there are uncertainties in the system modeling and the noise description, and 
the assumptions on the statistics of disturbances could be restrictive since the availability of 
a precisely known model is very limited in many practical situations. In practical tracking 
filter designs, there exist model uncertainties which cannot be expressed by the linear state- 
space model. The linear model increases modeling errors since the actual mobile station 
moves in a non-linear process. Especially even with a little priori knowledge it is quite 
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valuable concerning the strategy. 

Hence, the compensation of model uncertainties is an important task in the navigation filter 
design. In modeling or formulating the mathematical equations, the possible prediction 
errors are approximated or assumed as a model uncertainty. The facts discussed above leads 
to unexpected deterioration of the filtering performance. To prevent the divergence problem 
due to modeling errors in the EKF approach, the adaptive filter algorithm can be one of the 
good strategies for estimating the state vector. This chapter suggests the adaptive fading 
Kalman filter (AFKF) (Levy, 1997; Xia et al, 1994) approach as a robust solution. The AFKF 
essentially employs suboptimal fading factors to improve the tracking capability. In AFKF 
method, the scaling factor is introduced to provide an improved state estimation. The 
traditional AFKF approach for determining the scaling factors mainly depends on the 
designer's experience or computer simulation using a heuristic searching plan. In order to 
resolve this defect, the fuzzy adaptive fading Kalman filter (FAFKF) is proposed and used as 
an adaptive geolocation algorithm. The application of fuzzy logic to adaptive Kalman 
filtering gains more interests. The fuzzy logic adaptive system is constructed so as to obtain 
the suitable scaling factors related to the time-varying changes in dynamics. In the FAFKF, 
the fuzzy logic adaptive system (FLAS) is used to adjust the scaling factor continuously so 
as to improve the Kalman filter performance. 

In this chapter, we also explain how to compose the FAFKF algorithm for TDoA based 
position tracking system. Through the comparison using the simulation results from the 
EKF and FAFKF solution under the model uncertainties, it shows the improved estimation 
performance with more accurate tracking capability. 

2. Geolocation with TDoA analytical methods 

When the mobile station (MS: the unknown position) sends signals to each base station (BS: 
the known position), there is a time difference because of the BS's isolated location from MS. 
The fundamental principle of position estimation is to use the intersection of hyperbolas 
according to the definition of TDoA as shown in Fig. 1. 
The problem of geolocation can be formulated as 

d. =|s-Z?.| 

d n = ct n = Ct- - Ct 

b. = col{x i , y. } , i = l, 2, 3, • • • , m 
s = col{x,y} 

where b. is the known position of z-th signal receiver (BS), s is the unknown position of 
signal source (MS), and c is the propagation speed of signal. In Eq. (1), d. means the 
distance between MS and z-th BS and t. is the time of signal arrival (ToA) (Schau & 
Robinson, 1987) from MS to z-th BS. Hence t a becomes the time difference of arrival 
(TDoA) which is the difference of ToA between t. (from MS to the z-th BS) and t x (from 
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-5 -4 -3 -2 -1 

Fig. 1. Geometric method using hyperbolas. 



MS to the first BS). The distance difference of d a results from the multiplication of TDoA 
and c. 

Generally it is possible to estimate the source location if the values of ToA could be 
provided exactly. However, it is required to be synchronized for all MS and BS's in this case. 
To find the TDoA of acknowledgement signal from MS to BS's, the time delay estimation 
can be used. As shown in Fig. 1, the estimation of geolocation can be obtained by solving the 
nonlinear hyperbolic equation from the relation of TDoA. If there are three BS's as in Fig. 1, 
we can draw three distinct hyperbolic curves using distance difference from TDoA signal. It 
is the principle of geometric method that the cross point becomes the position estimation of 
MS. 

To find the position estimation (s) of the unknown MS in an analytical method, let's rewrite 
the distance difference equation (1) as 



■ d 1 + d a , i = 2,3,--,m. 



(2) 



By squaring Eq. (2) with the relation of (d. ) = is -b.,s-b.) , the nonlinear equation for 
positional vector of s can be formulated as following. 



||s|| -IbJs + lM =\\s\\ 2 -2b[s + ||b 1 | + 2d 1 d il +(d il ) 2 , i = 2 r ~,m 



(3) 
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To represent the solution in linear matrix equality form, Eq. (3) can be simplified as 



\\b t I + (d a ) 2 = 2(h —b.,s) - 2d x d iX , i = 2, • • • , m 



(4) 



Using the distance from MS to the first BS, (d x ) 2 = (x - x x ) 2 + (y - y x ) 2 , and with b 1 as the 
origin of coordinates, i.e., b 1 = col{0, 0} , we can obtain the position estimation from the 
following two nonlinear constraints. 



1 



(-\\b l f+(d a ) 2 ) = -(b l ,s)-d l d il9 i = 2,3, 



(5) 



x + y —(d 1 ) =0 



To find the solution of s, Eq. (5) is rewritten in linear matrix equation. Now the source vector 
s can be acquired by solving the following MS geolocation problem. 



Gs = h + pd l 

(d;) 2 =(s,s) 



h = 



Nf-(<*2i) 2 



<d ml f 



= -(d-p«p) 



(6) 



d u 


,d = 


'Kb)' 


d ml _ 




_{K,b m )_ 



where G = [b 2 • • • b m ] and • means the Hadamar operator. 



3. Geolocation with model uncertainty 

This section describes the geolocation using the estimation filter in state-space. As stated in 
the section 2, the conventional analytical methods are focused on solving the nonlinear 
hyperbolic equations. In this section, we introduce the fuzzy adaptive fading Kalman filter 
to get the precision estimation for multiple model uncertainties. 



3.1 System modeling 

In the real case, TDoA signal can be distorted by the timing error due to non-line-of sight or 
by additive white Gaussian noise. To find the precision geolocation in real case, the system 

modeling must include the model uncertainty. Let t o be the ideal TDoA signal and At is 
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the distorted amount by external noises. The real value of TDoA is changed as t = t + At . 

If the real value of TDoA is used in Eq. (6), it becomes more complicated nonlinear equation 
and this complexity may cause huge computational efforts in the real-time process. As a 
breakthrough to this problem, the Kalman filter which needs relatively less computational 
time can be an alternative solution. 

Since the hyperbolic equation of TDoA is nonlinear, the extended Kalman Filter (EKF) can 
be used as a nonlinear state estimator. The basic algorithm of EKF is shown as in Fig. 2. 



Time update 


a) The state projection 




**=/&>»* 


,»*) 


b)The 


error covariance 


projection 




/T-4^-A T 


-* t Qtf 



correction 



^ 



Measurement update 



c) Kalman gain update 



K,=PrHl[H,PrHl+KRXr 



d) The error covariance update 



P K -[I~K&]P, 



e) The estimate update with measurement 



*k = K- £ tl*k-HK>*f*k)] 



\ 



prediction 



Fig. 2. Flow chart of extended Kalman filter. 

The first step is the time update in which it predicts the state of next steps from processing 
model and it compares the real measurement with the prediction measurement of s 
obtained by time update process. For TDoA based geolocation using extended Kalman filter, 
the discrete state-equation of the processing and measurement model for MS can be 
formulated as 



S k+ l= As k+ Bu k+ W k 



A = 



1 A 0" 




"0 0" 


1 A 



, B = 




1 







1 



(7) 
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where s(k) = [x y x y] , u k is the known velocity of moving MS, A is the time interval 

of sampling, w k is an additive white Gaussian noise (AWGN). 

From the definition of TDoA, the measurement model can be written as 



1 



M 



Ms k -b t 



10 
10 



\Ms k -b.\\) + v k 



(8) 



where v k is the measurement noise in AWGN. 

The output result of z k which is the TDoA signal provides the information of MS position. 

As an accurate geolocation method, the frequency difference (FDoA) resulted from Doppler 

shifts observation can be added in the state equation. However, to make the problem more 

simple, we consider only the TDoA signal as an output measurement in this section. 

Since the measurement model z k is nonlinear equation, the linear approximation using 

partial differential method should be done for the use of EKF. 



z k = z k-i+ H h s k+ V k 



H, 



ds h 



3v. 



(9) 



3.2 Geolocation using fuzzy adaptive fading Kalman filter (FAFKF) 

EKF is a very useful method for nonlinear state estimation. However, as EKF is based on the 
linearization of nonlinear system using partial differential method, the modeling errors can 
easily lead to the divergence problem. To solve this problem, an adaptive fading Kalman 
filter (AFKF) with a fading factor can be applied. The application of AFKF to geolocation 
estimation is given in the following mathematical expression. 

Basically, the fading factor X is added in the error covariance projection during the time 

update process. 



p-=X k A k P k _X+W k QX 



(10) 



where \ = diag(A [ , A 2 , • • • , A m ) . In normal case of X k -\,\\ means the general EKF. If the 

estimated value approaches to the steady-state value, the fading factor A becomes less 

than 1. If X k is greater than 1, the divergence could happen. This iterative process is called 
as the adaptive fading loop given as follows. 
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A k+] = max<M, 



tr[E k ] 

F k =c o~ R k ~ H kQk H l 

E k =H k A k P k A T k H T k 



C n 



Ml 

2 ' 



i+A 



(ii) 
k = o 



k>\ 



<l>k= z k-Zk> z k =h(s k9 v k ) 

where a is a scaling factor and tr[-] is the trace of a matrix. 

Moreover the measurement estimation of z k is predicted through the estimation of s k . 
That is, if we get more accurate s k , then the more accurate z k can be obtained. As the 
error of output measurement is within the s - neighborhood i.e., \\ z k - z k 1 1< s k and 
£ k - £ k _ x < , it is confirmed that the present estimation performance is guaranteed and the 

fading factor becomes X k+X < 1 . 

The fuzzy logic adaptive system (FLAS) offers an effective method when the problem is too 
complicated or hard to be analyzed in mathematical way. The procedure of general fuzzy 
system can be classified as three parts; fuzzification, fuzzy inference, and defuzzification. 
The first step of fuzzification is to make linguistic variables from inputs and outputs. The 
second step of fuzzy inference is to make rules using if-then expression. Finally the third step 
of defuzzification is to decide the degree of the output value. 

Using the scaling factor ( a ) as an output from FLAS, the fading factor in FAFKF is updated 
as A k+l = a - tr[F k ]/tr[E k ] . According to the following two degree of divergence (DoD) 
parameters from the innovation covariance matrix and the trace of innovation covariance 
matrix, it is possible to identify the changing degree in dynamics of MS. The first DoD 
parameter 8 is defined as the ratio of the trace of innovation covariance matrix at present 
state and the number of measurements used for estimating location. 

S = & (12) 

m 

where (j) k = \(j) x (f) 2 • • • (f) m ] , m is the number of measurements (number of TDoA signals). 
The second DoD parameter a is defined as the average of the absolute value of the 
measurement error (j) k . 



<T = -£l4l ( 13 ) 



m , =1 
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The fading factor A k updated through the adaptive fading loop is used to change the error 



covariance P k . 
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Fig. 3. Flow chart of the fuzzy adaptive fading Kalman filter process. 

Fig. 3 shows how the FAFKF works for TDoA geolocation problem. As a first step in the 
process of FAFKF, the two DoD parameters ( S, a ) are obtained from measurement 

difference between the real value ( z k ) and the estimation result ( z k ). These DoD 

parameters are used as the inputs for the fuzzy system. Finally the FLAS is employed for 
determining the scaling factor a from the innovation information. According to the scaling 
factor a, the estimation accuracy is determined. Using the fuzzy logic system, we can 
adjust the fading factors adaptively to improve the estimation performance. 



4. Simulation results 

The basic circumstance to be used in the simulation is shown in Fig. 4. There are two BS r s 
and the signal source of MS is supposed to move at a constant speed but changes its 
direction every 2.5 sec. In Fig. 4, the dotted line is an ideal path of MS with no external forces. 
The solid line is the real path which is affected by the multiple noises such as the 
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measurement noise v k and the process noise w k . The thick solid line is the path of MS 

estimated by the standard EKF with no adaptive method. Fig. 4 shows that the performance 
of EKF is restricted especially when the MS changes the direction. The accumulated position 
error is increased as the MS changes its direction frequently. 

To prove the effectiveness of the adaptive fading factor in TDoA gelocation, the simulation 
parameters are set close to the real values. Table 1 shows the simulation parameters. 



Ideal Path 
-Real Path 
■EKF 



BS2(12,5) 




BS1(0,0) 



2 4 6 S 10 12 

Fig. 4. Simulation circumstance for MS 

The FLAS consists of the following 9 rules and is represented in the following if-then form. 
The membership functions of input fuzzy variable (DoD parameters: 8 and a ) and 
output (scaling factor: a ) are shown in Fig. 5. 

i. if 8 is n (negative) and a is n, then a is nb (negative big), 
ii. if 8 is z (zero) and a is n, then a is ns (negative small). 





EKF 


AFKF 


FAFKF 


Speed 


Constant 


Constant 


Constant. 


D (time interval) 


0.1 sec 


0.1 sec 


0.1 sec 


a (scaling factor) 


None 


0.12 


FLAS output 


X (fading factor) 


None 


Constant 


Fuzzy based 



Table 1. Parameters for the TDoA geolocation simulation 
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0.005 0.01 



0.015 0.02 0.025 

(a) input #1 (t?) 




0.1 0.15 

(c) output (a) 



Fig. 5. Membership functions in FLAS 



in. 

iv. 

v. 

vi. 
vii. 
viii. 

ix. 



if 8 is p (positive) and 

if 8 is n and a is z, 

if 8 is z and <j is z, 

if £ is p and <j is z, 

if 8 is n and cr is p, 

if £ is z and <j is p, 

if £ is p and cr is p, 



cr is n, then or is z. 

then or is ns. 

then a is z. 

then cir is ps (positive small), 

then a is z. 

then a is ps. 

then a is pb (positive big). 



As the DoD parameter ( 8 ) and the averaged magnitude ( cr ) of (j)(k) change within 
0.003-0.007 and 0.03-0.1 respectively, we define those range as zero for 8 and cr . The 
output of the scaling factor ( a ) is determined as 0.12 following that of AFKF in the 
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associated range. Other values can be determined from experiential way. The simulation 
result of FLAS and adaptive fading loop is given in Fig. 6. 

Fig. 6 shows the change of the scaling factor a k and the fading factor X k . The values of 
a k and A k change very steeply to correct the position error from the beginning and the 
estimate s k gets close to the real value within s -neighborhood about after 1 sec since the 
fading factor becomes small. 



0.2 



B 0.15 - 



0.05 



4 5 6 

Tirne(sec) 

(a) Change of Scaling factor 



■Constant tx 
-FLAS ex 




10 



6 

5 

B 4 

o 

m - 1 

~° -I 
•B ^ 

*1 





1 

+ 


i 


I I I I I 


i i 


■■■■+■■■■ Constant a 


- 






— ■ — FLAS a 


- 


: : ; 




+ 


+ 

+ 


+ + 




+ "■ 


'♦A 


i 


1 


l l l l l 


1 1 



12 3 4 5 6 

Time(sec) 

(b) Change of Fading factor 
Fig. 6. Change of scaling factor ( a ) and fading factor (A ). 



10 



Fig. 7 shows the performance of the proposed geolocation algorithm (FAFKF) through the 
comparison with AFKF and EKF. The performance is measured in terms of the norm of 
positioning error, i.e. \\s k —sA . As shown in Fig. 7, the positioning error of FAFKF is much 

smaller than that of EKF. It can be confirmed that the difference of position error between 
EKF and FAFKF is increased as the MS changes its direction more frequently. It means that 
the position estimation with FAFKF is tracking more precisely to the real value of s k than 
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Fig. 7. Comparison of error performance 



AFKF or the standard EKF. 

Fig. 8 indicates the path estimation performance of the proposed geolocation algorithm 

through the comparison with AFKF and EKF under the situation of Fig. 4. As the adaptive 

fading factor takes the sub-optimal value at each iteration, the error covariance has been 

updated and is used to modify the Kalman filter gain adaptively. As shown in Fig. 8, the 

trajectory estimation using FAFKF is close to the real value under noise added real 

circumstance. 



5. Conclusion 

In this chapter, we introduced TDoA geolocation algorithm to reduce the position 
estimation error. To be more similar to real circumstance, the MS is supposed to change its 
direction periodically. The standard EKF which solves a huge computational problem of 
TDoA based geolocaion can estimate the location of source through the linearization of 
nonlinear measurement equation. However, the linearization from partial differentiation 
causes a divergence problem which restricts the performance of the EKF. 
To solve this problem, we applied FAFKF algorithm which changes the error covariance 
using an adaptive fading factor ( A ) from fuzzy logic. The scaling factor a which is used 
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to update the fading factor has been decided by the fuzzy logic to minimize the estimation 
error. Through the simulation results, it is confirmed that the trajectory estimation using 
FAFKF follows the real one more precisely than EKF. The positioning error from FAFKF is 
less than that performed by AFKF. 
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1. Introduction 

Adaptive control of highly uncertain nonlinear dynamic systems has been an important 
research area in the past decades, and in the meantime neural networks control has found 
extensive application for a wide variety of areas and has attracted the attention of many 
control researches due to its strong approximation capability. Many significant results on 
these topics have been published in the literatures (Lewis et aL, 1996 ; Yu & Li 2002; 
Yesidirek & Lewis 1995). It is proved to be successful that neural networks are used in 
adaptive control. However, most of these works are applicable for a kind of affine systems 
which can be linearly parameterized. Little has been found for the design of specific 
controllers for the nonlinear systems, which are implicit functions with respect to control 
input. We can find in literatures available there are mainly the results of Calise et al. (Calise 
& Hovakimyan 2001) and Ge et al. (Ge et al. 1997). Calise et al. removed the affine in control 
restriction by developing a dynamic inversion based control architecture with linearly 
parameterized neural networks in the feedback path to compensate for the inversion error 
introduced by an approximate inverse. However, the proposed scheme does not relate to the 
properties of the functions, therefore, the special properties are not used in design. Ge, S.S. 
et al., proposed the control schemes for a class of non-affine dynamic systems, using mean 
value theorem, separate control signals from controlled plant functions, and apply neural 
networks to approximate the control signal, therefore, obtain an adaptive control scheme. 
Furthermore, when controlling large-scale and highly nonlinear systems, the presupposition 
of centrality is violated due to either due to problems in data gathering when is spread out 
or due to the lack of accurate mathematical models. To avoid the difficulties, the 
decentralized control architecture has been tried in controller design. Decentralized control 
systems often also arise from various complex situations where there exist physical 
limitations on information exchange among several subsystems for which there is 
insufficient capability to have a single central controller. Moreover, difficulty and 
uncertainty in, measuring parameter values within a large-scale system may call for 
adaptive techniques. Since these restrictions encompass a large group of applications, a 
variety of decentralized adaptive techniques have been developed (Ioannou 1986). 
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Earlier literature on the decentralized control methods were focused on control of large- 
scale linear systems. The pioneer work by Siljak (Siljak 1991) presents stability theorems of 
interconnected linear systems based on the structure information only. Many works 
consider subsystems which are linear in a set of unknown parameters (Ioannou 1986 ; Fu 
1992 ; Sheikholeslam & Desor 1993 ; Wen 1994 ; Tang et al. 2000), and these results were 
focused on systems with first order interconnections. When the subsystems has nonlinear 
dynamics or the interconnected is entered in a nonlinear fashion, the analysis and design 
problem becomes even challenging. 

The use of neural networks' learning ability avoids complex mathematical analysis in 
solving control problems when plant dynamics are complex and highly nonlinear, which is 
a distinct advantage over traditional control methods. As an alternative, intensive research 
has been carried out on neural networks control of unknown nonlinear systems. This 
motivates some researches on combining neural networks with adaptive control techniques 
to develop decentralized control approaches for uncertain nonlinear systems with 
restrictions on interconnections. For example, in (Spooner & Passino 1999), two 
decentralized adaptive control schemes for uncertain nonlinear systems with radial basis 
neural networks are proposed, which a direct adaptive approach approximates unknown 
control laws required to stabilize each subsystem, while an indirect approach is provided 
which identifies the isolated subsystem dynamics to produce a stabilizing controller. For a 
class of large scale affine nonlinear systems with strong interconnections, two neural 
networks are used to approximate the unknown subsystems and strong interconnections, 
respectively (Huang & Tan 2003), and Huang & Tan (Huang & Tan 2006) introduce a 
decomposition structure to obtain the solution to the problem of decentralized adaptive 
tracking control a class of affine nonlinear systems with strong interconnections. Apparently, 
most of these results are likewise applicable for affine systems described as above. For the 
decentralized control research of non-affine nonlinear systems, many results can be found 
from available literatures. Nardi et al. (Nardi & Hovakimyan 2006) extend the results in 
Calise et al. (Calise & Hovakimyan 2001) to non-affine nonlinear dynamical systems with 
first order interconnections. Huang (Huang & Tan 2005) apply the results in (Ge & Huang 
1999) to a class of non-affine nonlinear systems with strong interconnections. 
Inspired by the above researches, in this chapter, we propose a novel adaptive control 
scheme for non-affine nonlinear dynamic systems. Although the class of nonlinear plant is 
the same as that of Ge et al. (Ge et al. 1997), utilizing their nice reversibility, and invoking 
the concept of pseudo-control and inverse function theorem, we find the equitation of error 
dynamics to design adaptation laws. Using the property of approximation of two-layer 
neural networks (NN), the control algorithm is gained. Then, the controlled plants are 
extended to large-scale decentralized nonlinear systems, which the subsystems are 
composed of the class of non-affine nonlinear functions. Two schemes are proposed, 
respectively. The first scheme designs a RBFN-based (radial basis function neural networks) 
adaptive control scheme with the assumption which the interconnections between 
subsystems in entire system are bounded linearly by the norms of the tracking filtered error. 
In the scheme, unlike most of other approaches in available literatures, the weight of BBFN 
and center and width of Gaussian function are tuned adaptively. In another scheme, the 
interconnection is assumed as stronger nonlinear function. Moreover, in the former, in every 
subsystem, a RBFN is adopted which is used to approximate unknown function, and in the 
latter, in every subsystem, two RBFNs are respectively utilized to approximate unknown 



Adaptive Control for a Class of Non-affine Nonlinear Systems via Neural Networks 339 



function and uncertain strong interconnection function. For those complicated large-scale 
decentralized dynamic systems, in order to decrease discontinuous factors and make 
systems run smooth, unlike most of control schemes, the hyperbolic tangent functions are 
quoted in the design of robust control terms, instead of sign function. Otherwise, the citation 
of the smooth function is necessary to satisfy the condition of those theorems. 
The rest of the paper is organized as follows. Section 2 gives the normal form of a class of 
non-affine nonlinear systems. Section 3 proposes a novel adaptive control algorithm, which 
is strictly derived from some mathematical and Lyapunov stability theories, and the 
effectiveness of the scheme is validated through simulation. Extending the above-mentioned 
result, Section 4 discusses two schemes of decentralized adaptive neural network control for 
the class of large-scale nonlinear systems with linear function interconnections and 
nonlinear function interconnections, respectively. Finally, the Section 5 is concluding 
remarks. 



2. Problem Statement 

We consider a general analytic system 

fc = g&u), ?e*\ usR (1) 

\y = h{Q, ysR. 

where g(v) i s a smooth vector fields and h(-) is a scalar function. In practice, many 
physical systems such as chemical reactions, PH neutralization and distillation columns are 
inherently nonlinear, whose input variables may enter in the systems nonlinearly as 
described by the above general form (Ge et al. 1998). Then, the Lie derivative (Tsinias & 
Kalouptsidis 1983) of h(Q with respect to g(^,w) is a scalar function defined 

by L h = [dh(Q/dQg(^,u) . Repeated Lie derivatives can be defined recursively 
as E h — L (E~ l h\ for i = l,2-- . The system (1) is said to have relative degree CC 
at (£ ,w ) , if there exists a smallest positive integer CC such 
thatdll g h/du = ,dL a g h/du * 0, / = l,--,a-l. 

Let £X d R n and Cl u d Rbe compact subsets containing ^ Q andW , respectively. System 
(1) is said to have a strong relative degree CC in a compact seiD = £X x Cl u , if it has relative 
degree CC at every point (£ ,w ) e D . Therefore, system (1) is feedback linearizable and the 
mapping O(0 = [^(0,^(0,--^ I (0] / with ( /) j (Q = L^% j = l,2, — a has a Jacobian 
matrix which is nonsingular for all x g 0(Q , system (1) can be transformed into a normal 
form 
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; ( 2 ) 

x n =f(x,u) 

y = x { 

where f(x, u) = L n h and x = O" 1 (Q with X = [Xj , X 2 , • • • , X n ] . Define the domain of 
normal system (2) as£) Q l(x,w)|x G 0(Q^);w gQ). 

3. Adaptive Control for a Class of Non-affine Nonlinear Systems via Two- 
Layer Neural Networks 

Now we consider the n — th order nonlinear systems of the described form as (2). For the 

considered systems in the chapter, we may make the following assumptions. 

Assumption 1. df(x, u)/du^0 for all (x, u) G Q X R . 

Assumption 2. /(•) : R n+l — » R , is an unknown continuous function and /"(x, w) a smooth 

function with respect to control input U . 

The control objective is: determine a control law, force the output, y , to follow a given 

desired output, X d with an acceptable accuracy, while all signals involved must be 

bounded. 

Assumption 3. The desired signals x d (t) = [y d ,y ( ^ ,"-,y ( d n ~ l) ], and X d =[x d ,y d ] T are 

bounded, with Xj < X d , X d a known positive constant. 
Define the tracking error vector as 

e = X - X d , (3) 

and a filtered tracking error as 

T = [A T l]e, (4) 

with A a gain parameter vector selected so that e(t) — > as r — » 0. Differentiating (4), the 
filtered tracking error can be written as 

T = x„- X y+[0 A r ]e. (5) 

Define a continuous function 
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S = -kr + x™-[0 A T ]e. (6) 

where k is a positive constant. We know df(x^u)/du^0 (Assumption 1), thus, 
d[f(x,u)- S]/du ^ . Considering the fact that dS/du = , we invoke the implicit 
function theorem (Lang 1983), there exists a continuous ideal control input U in a 
neighborhood of (x, u) G Q X R , such that /(x, W* ) - £ = , i.e. £ = f(x,U*) holds. 
^ = /* (x, U ) may represent ideal control inverse. 
Adding and subtracting 8 to the right-hand side of X n = f(x, u) of (2), one obtains 

x n =/(x,w)-^-A:r + x^ ) -[0 A T ]e, (7) 

and yields 

f = -kr + /(x, w) - 8. (8) 

Considering the following state dependent transformation {// = X n , where ^ is commonly 

referred to as the pseudo-control (Calise & Hovakimyan 2001). Apparently, the pseudo- 
control is not a function of the control U but rather a state dependent operator. 

Then, dy//du = , from Assumption 1, df(x,u)/du ^ thus [y/ - f(x,u)]/du ^ . 
With the implicit function theorem, for every (x, u) E Q X R , there exists a implicit 
function such that l// - f(x, u) = holds, i.e. \j/ = f(x, u) . Therefore, we have 

y/ = f(x,u). (9) 

Furthermore, using inverse function theorem, with the fact that \\j/ — f(x,u)]/du ^ 
and f(x,u) is a smooth with respect to control input, U , then, f(x,u) defines a local 
diffeomorphism (Slotine & Li 1991), such that, for a neighborhood of U , there exists a 
smooth inverse function and u = f (x, y/) holds. If the inverse is available, the control 
problem is easy. But this inverse is not known, we can generally use some techniques, such 
as neural networks, to approximate it. Hence, we can obtain an estimated function, 

U = f~ (x, y/) . This result in the following equation holding: 

y/ = f(x,u), (10) 

where \j/ may be referred to as approximation pseudo-control input which represents 
actual dynamic approximation inverse. 
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Remark 1. According to the above-mentioned conditions, when one designs the pseudo- 
control signal, \j/ , must be a smooth function. Therefore, in order to satisfy the condition, 
we adopt hyperbolic tangent function, instead of sign function in design of input. This also 
makes control signal tend smooth and system run easier. The hyperbolic tangent function 
has a good property as follows (Polycarpou 1996) : 

< I/7I - 7j tanh(— ) < qa , (11) 

a 

with £- = 0.2785, <% any positive constant. Moreover, theoretically, \j/ is approximation 
inverse, generally a nonlinear function, but it must be bounded and play a dynamic 
approximation role and make system stable. Hence, it represents actual dynamic 
approximation inverse. 
Based on the above conditions, in order to control the system and make it be stable, we 

design the approximation pseudo-control input y/ as follows: 

ij> = f(x,u*) + u ad +v r , (12) 

where U ad is output of a neural network controller, which adopts a two-layer neural 

network, V r is robustifying control term designed in stability analysis. 

Adding and subtracting y/ to the right-hand side of (8), with 8 = f(x, U ) , we have 

i = -k T + f(x, u) + y/- f(x, u)-u ad -v r -S 

( 13 ) 
= -kr + A(x, u, u ) + y/ -S -u ad -v r , 

where A(x, U,U ) = f(x, u) — f(x, U ) is error between nonlinear function and its ideal 
control function, we can use the neural network to approximate it. 

3.1 Neural network-based approximation 

A two-layer NN consists of two layers of tunable weights, a hidden layer and an output 
layer. Given a £ > , there exists a set of bounded weights M and N such that the 

nonlinear error A e C(Q) , with Q compact subset of R n , can be approximated by a two- 
layer neural network, i.e. 

k = M T a{N T x nn ) + s{x nn ), (14) 

with X nn - [1, X d , e , y/] input vector of NN. 
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Assumption 4. The approximation error £ is bounded as follows: 

\s\<S N , (15) 

where 6 N > is an unknown constant. 

Let M and TV be the estimates respectively of M and TV . Based on these estimates, let 
U ad be the output of the NN 

u ad =M T a(N T x nn ). (16) 

Define M=M-M and N = N- N , where we use notations: Z = diag[M , N] , 

Z - diag[M,N] , Z — diag[M , N] for convenience. Then, the following inequality 
holds: 

^(Z r Z)<|zy|z| F -|z|^. (17) 

The Taylor series expansion of (j(N X nn ) for a given X nn can be written as: 

^N T x nn ) = a(N T x nn ) + <r'(N T x m )N T x m + 0(N T x nn f , (18) 

with & '= <j{N X nn ) and <J denoting its Jacobian, 0{N X ) the term of order two. In 

the following, we use notations: (J '= c(N X nn ) , G '= <j{N X nn ) . 

With the procedure as Appendix A, the approximation error of function can be written as 

M T a(N T xJ-M T a(N T xJ = M T (a-a'N T x nn ) + M T a'N T x nn +cu, (19) 
and the disturbance term CO can be bounded as 

|»| < |tf| F |xJl/ r <7'| F +|M||d-W r x„„| + |M| i , (20) 

where the subscript "F" denotes Frobenius norm, and the subscript "1" the 1-norm. 
Redefine this bound as 

U<pJ m (M,N,x,J, (21) 
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xM T a' 



+ 



where p a = max{|M|,|7V| F ,\\M\[} and 3 a 
p„ is an unknown coefficient, whereas i9 is a known function. 

i CO 'CO 



<r'N T x„ 



+ 1 . Notice that 



3.2 Parameters update law and stability analysis 

Substituting (14) and (16) into (13), we have 

t = -kr + M T a(N T x m ) - M T a<N T x m ) + y>-v r -S + s{x m ). 
Using(19), the above equation can become 

i = -kr + M T (a- <j'N T x nn ) + M T a'N T x m +y> -S-v r +co + s. 



(22) 



(23) 



Theorem 1. Consider the nonlinear system represented by Eq. (2) and let Assumption 1-4 
hold. If choose the approximation pseudo-control input y/ as Eq.(12), use the following 
adaptation laws and robust control law 



M = F[(a-a'Nx m )T-k 1 M\r\], 
N = R[x nn M T a'T-k l N\r\^, 



^ = Hr(5 ffl +l)tanh 
v r =-^+l)tanh 



a 
a 



-X<j> 



(24) 



where F = F T > 0, R = R T > are any constant matrices, A^ > and y > are scalar 
design parameters, (ft is the estimated value of the uncertain disturbance term 

(/) — max(/7^ , S N ) , defining (j) = (j) — with (ft error of (ft , then, guarantee that all signals 

in the system are uniformly bounded and that the tracking error converges to a 

neighborhood of the origin. 

Proof. Consider the following positive define Lyapunov function candidate as 



L = - t 2 + - tr(M T F~ l M) + - tr(N T R~ l N) + - y' 1 ^ 2 



The time derivative of the above equation is given by 

L = tt + tr(M T F~ l M) + tr(N T R~ l N) + y~^ 



(25) 



(26) 
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Substituting (23) and the anterior two terms of (24) into (26), after some straightforward 
manipulations, we obtain 



(27) 



L = -kr 2 + t[M t (a - &'N T x m ) + M T a'N T x m + (y> - 8) - v r + co + s] 
+ tr(M T F~ l M) + tr(N T RT l N) + x _1 # 
= -kr 2 + r(y> -8)- rv r + t(co + s) + y ~ l </></> + k x \c\ tr(Z T Z). 
< -kr 2 + r(y> -8)- rv r + Id #(&„ + 1) + f _1 # + k x Id fr-(Z r Z). 



With (4), (6), (12), (16) and the last two equations of (24), the approximation error between 
actual approximation inverse and ideal control inverse is bounded by 



\y/ - S\ < c x + c 2 Id + c 3 \\z\\ , 



(28) 



where C x , C 2 , C 3 are positive constants. 

Using (11) and the last two terms of (24), we obtain 



L < -kr 2 + r(y> -8)- T0(3 a + 1) tanh 



<^+l) 



+ 1^(^+1)-^ r(^+l) tanh 



a 
a 



-X$\ + k x \r\tr{Z T Z) ( 2 9) 



< -kr 2 + r(y> -8) + g</>a + X<JhJ> + k x \r\ tr(Z T Z) 



Applying (17),(28) , and^ < 
inequality 



, after completing square, we have the following 



L<-(k-c 2 )\r\ 2 +D x \t\ + D 2 
1 



(30) 



k c i 

where/} = C X +MZ M + ~ L )\D 2 = - l(j) 2 +gfa • 
4 k x ~ 4 



Let D 3 = ^D x + 4D 2 (k-C 2 )+D x , thus, as long as |r | > D 3 /[2(k - C 2 )] , and k>C 2 , 
then Z < holds. 



Now define 
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Q, = {fl \$\<^) 9 Q z =\z 



||z|| f <1(^z m+ c 3 )^ Q r =\r 



rk— !— Ak ( 31 ) 



2(Jt-c 2 ) 



Since Z M ,^,^,1)^ Z) 2 , Z) 3 , C 2 , C 3 are positive constants, as long as A: is chosen to be big 
enough, such that k > C 2 holds, we conclude that Q , , Q z and Q r are compact sets. 

Hence L is negative outside these compacts set. According to a standard Lyapunov 
theorem, this demonstrates that <ft,Z and T are bounded and will converge 
to Q , , Q z and Q r , respectively. Furthermore, this implies e is bounded and will converge 
to a neighborhood of the origin and all signals in the system are uniformly bounded. 

3.3 Simulation Study 

In order to validate the performance of the proposed neural network-based adaptive control 
scheme, we consider a nonlinear plant, which described by the differential equation 

2 2 3 2 2 ( 32 ) 

x 2 = -co x x - 0.02(a) + x x )x 2 + u + (x x + x 2 )<j(u) + tanh(0.2w) + d 

where CO = Aft , <j(u) = (1 -e~ u )/(\ + e~ u ) and J = 0.2 . The desired trajectory 
x d =0.1;r[sin(2£)-cos(7)]. 

To show the effectiveness of the proposed method, two controllers are studied for 
comparison. A fixed-gain PD control law is first used as Polycarpou, (Polycarpou 1996). 
Then, the adaptive controller based on NN proposed is applied to the system. 

Input vector of neural network is x nn = [1, X d , e ,lj/], and number of hidden layer nodes 25. 
The initial weight of neural network isM(O) = (0), A^(0) = (0) . The initial condition of 
controlled plant is x(0) = [0.1,0.2] . The other parameters are chosen as follows: 
^ = 0.01, ^ = 0. 1, A = 0.01, or = 10 , A = 2,F = 8/ M , R = 5I N , with I M J N corresponding 
identity matrices. 

Fig.l, 2, and 3 show the results of comparisons, the PD controller and the adaptive controller 
based on NN proposed, of tracking errors, output tracking and control input, respectively. 
These results indicate that the adaptive controller based on NN proposed presents better 
control performance than that of the PD controller. Fig.4 depicts the results of output of NN, 

norm values otM,N , respectively, to illustrate the boundedness of the estimates of 

M,N and the control role of NN. From the results as figures, it can be seen that the 
learning rate of neural network is rapid, and tracks objective in less than 2 seconds. 
Moreover, as desired, all signals in system, including control signal, tend to be smooth. 
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Fig. 1. Tracking errors: PD(dot) and NN(solid). 
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Fig. 2. Output tracking: desired (dash), NN (solid) and PD(dot). 
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Fig. 3. Control input: PD (dash), NN(solid) 
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4. Decentralized Adaptive Neural Network Control of a Class of Large-Scale 
Nonlinear Systems with linear function interconnections 

In the section, the above proposed scheme is extended to large-scale decentralized nonlinear 
systems, which the subsystems are composed of the class of the above-mentioned non-affine 
nonlinear functions. Two schemes are proposed, respectively. The first scheme designs a 
RBFN-based adaptive control scheme with the assumption which the interconnections 
between subsystems in entire system are bounded linearly by the norms of the tracking 
filtered error. In another scheme, the interconnection is assumed as stronger nonlinear 
function. 

We consider the differential equations in the following form described, and assume the 
large-scale system is composed of the nonlinear subsystems: 

x n ~ x a 



X i2 ~ X i3 



(33) 



X il t ~ Ji \ X i\ •> X i2 •> " ' ' X ili -> U i)^~ Si \ X \ ? X 2 ? " ' ' X n ) 

U- = x a 

where X. E R { is the state vector, X. = [x n , X i2 , * * * , X n ] , U f G R is the input and 

y. E R is the output of the i — th subsystem. 

f t (x. , U i ) \ R -^ R is an unknown continuous function and implicit and smooth 

function with respect to control input U . . 

Assumption 5. df t (x. , U ( ) / du . ^ for all (x. , U . ) E Q . X R . 

g. (Xj , X 2 , • • • , X n ) is the interconnection term. In according to the distinctness of the 
interconnection term, two schemes are respectively designed in the following. 

4.1 RBFN-based decentralized adaptive control for the class of large-scale nonlinear 
systems with linear function interconnections 

Assumption 6. The interconnection effect is bounded by the following function: 

n 

|g.(x 1? x 2? --- ? xj|<^^|^| ( 34 ) 

7=1 

where /.. are unknown coefficients, T . is a filtered tracking error to be defined shortly . 
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The control objective is: determine a control law, force the output, y. , to follow a given 

desired output, X di , with an acceptable accuracy, while all signals involved must be 
bounded. 

Define the desired trajectory vector x di = [y di ,y di ,--, J^ -1 f and X di = [y di , y di , • • • , y% } ] , 

tracking error £. = X i — X di = [e n , e i2 , * * * , 6 tl ] , thus, the filter tracking error can be 
written as 

T . = [Af l]e . = k iX e t + k i2 e t + • • • + k^e^ + ^ -1) , (35) 

where the coefficients are chosen such that the polynomial k n + K i 2 S H h A:. / ^ ' 

+^ (/ ' _1) is Hurwitz. 

Assumption 7. The desired signal X di (f) is bounded, so that Of J < Jf^. , where X di '\s a 

known constant. 

For an isolated subsystem, without interconnection function, by differentiating (35), the 

filtered tracking error can be rewritten as 

i t = x«r x * )+ W \ T ~\ e i = fi( x i> u i) +Y <n ( 36 ) 

with 7, =-*«>+[<> A/>,. 

Define a continuous function 



5 i= -k iTi -Y di (37) 

where A:, is a positive constant. With Assumption 5, we know df{x n U i )/du i ^ , 

thus, 9[y(jc. , W. ) — S t ]/du. ^0 . Considering the fact that dSjdu. = , we invoke the 

implicit function theorem, there exists a continuous ideal control input W. in a 

neighborhood of (x. , M f ) e Q f x i? , such that /(x. , l^.* ) - ^. = , i.e. S t = f t (x f , U . ) holds. 

^ = // (X ? w * ) represents ideal control inverse. 

Adding and subtracting ^. to the right-hand side of X u . = f t (x. , U t ) + g" . of (33), one 

obtains 

*,/,. = ft (*,■ »",-) + g, - $ - *,■*■,■ - ^ / (38) 

and yields 
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*i = - k < T < + ft 0, > u i ) + St - 5 i ■ (39) 

In the same the above-discussed manner as equations (9) -(10) , we can obtain the following 
equation: 

¥i=f i (x i ,u i ). (40) 

Based on the above conditions, in order to control the system and make it be stable, we 
design the approximation pseudo-control input \j/ i as follows: 

V> i =-k i T i -Y di +u ci +v ri , (41) 

where U ci is output of a neural network controller, which adopts a RBFN, V ri is 
robustifying control term designed in stability analysis. 

Adding and subtracting \j/ i to the right-hand side of (39), with 5 t = —k i T i —Y di = f t (x i9 u*) , 
we have 

f. = -k i r i + A . (x. , u t , u* ) - u ci + y/ t - 8 t - v ri + g. , (42) 

where A . (x. , W^. , U t ) — /J (^. , W . ) — /J (x ? . , W ; . ) is error between nonlinear function and its 
ideal control function, we can use the RBFN to approximate it. 

4.1.1 Neural network-based approximation 

Given a multi-input-single-output RBFN, let fl Xi and Hi u be node number of input layer and 
hidden layer, respectively. The active function used in the RBFN is Gaussian 
function, S t (x) = exp[-0.5(|z. - ju lk \f ) / a\ ] , / = 1, • • •, n x . , k = 1, • • •, m u where z. e R" liXl is input 
vector of the RBFN, ju t e R niiXmii and cr. e i? OTl xl are the center matrix and the width vector. 
Based on the approximation property of RBFN, A (x , U. , U. ) can be written as 

A . (x,. , u, , U* ) = W i r S i (z,. , ft, , a, ) + 5,. (z. ) , (43) 

where ^(z) is approximation error of RBFN, W t E i? mi;X . 

Assumption 8. The approximation error 8{x nn ) is bounded by 6*. < 6* M , with6* M > is 

an unknown constant. 

The input of RBFN is chosen as z. — [x r , T. , y/. ] T . Moreover, output of RBFN is designed as 
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Uj^Sfa, &,&,). (44) 

Define W i , jU f , C i as estimates of ideal W i9 JU i9 C i , which are given by the RBFN tuning 

algorithms. 

Assumption 9. The ideal values of W i9 jU n C i satisfy 

IM^> NL-^/w. INI^m- (45) 

where W iM , /U iM ,<J iM are positive constants. • and • denote Frobenius norm and 2- 
norm, respectively. Define their estimation errors as 

W t =W t -W i9 #=#-#> CJ,=C7,-C7 r (46) 

Using the notations: Z. = diag\W t , //. , O". ] , Z. = diag\W i , //. , cr. ] , Z. = diag\W i , //. , <r. ] for 

convenience. 

The Taylor series expansion for a given jU t and C i is 

Sfa,^^) = Sfatfiv&J + S'^fit +S , (7i a i +0{ju i ,d i f (47) 

where ^. D dS k {z n /!.,&.) /d/l n S^U dS k {z i ,jd i ,d l )jda i evaluated at //.=//. , 
O". = (J. , 0(//. , (7. ) denotes the terms of order two. We use notations: S '= S . (z. , // , <7 ), 

5. := 5,. (z. , //. , (7,. ) , 5. := S. (z. , //. , a. ) . 

Following the procedure in Appendix B, it can be shown that the following operation. The 
function approximation error can be written as 

The disturbance term co i (t) is given by 

a>, (t) = W, T {S t -§,) + Wj (5;. ft, + S'^ ) - Wj (S' MiMi + S>, ) (49) 

Then, the upper bound of 0) t (t) can be written as 

k(o| * W(|ftA|L +|c^|L)+|^^;|L hi +Ws'J kHW, < Pai & ai m 

i i ii ii ii i" n/< ii n/< i~ p ii lu p ii ii ii in 
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where p ai = max(|WL|/J ,|k||,2|^|l ), d ai = \\S'M +p' al & l \\ +Ws'J +\\W l T S'J+l , 



int~x || F 



with • I 1 norm. Notice that n is an unknown coefficient, whereas 3 . is a known 

II 111 A^<yi ' <yz 

function. 



4.1.2 Controller design and stability analysis 

Substituting (43) and (44) into (42), we have 

* t = -to + w, T s, - w, T s, + y>, - S t - v ri + gt + s, (z, ) , 

using (48), the above equation can become 

+ Wi -$i~ v ri + g, + s t (h ) + co t (t). 



(51) 



(52) 



Theorem 2. Consider the nonlinear subsystems represented by Eq. (33) and let assumptions 
hold. If choose the pseudo-control input \J/ i as Eq.(41), and use the following adaptation 
laws and robust control law 



A=F, [(S, - S' Mi A - KAi K - YwWi Vi I 
k=G l [s'JW i T i -y Wi M l \T\ 

* i =H i [s'JW i h-YwAV i \ 



*,=; 






d i = r di (T?- ;i J i \T i \), 






(53) 
(54) 
(55) 

(56) 

(57) 
(58) 
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where £* . = & mi +1 , K = F ( T > 0, G- = G. 7 > 0, H. = Hf > are any constant 
matrices, ^ • , y , . , ^. , /l, . , ^. and <Z. are positive design parameters, ^. is the estimated 
value of the uncertain disturbance term (ft. = max(/7 fy . , 8 m ) , defining 0. = fy — fy with 
(/) t error, <i. > is used to estimate unknown positive number to shield interconnection 
effect, d i is its estimated value, with d. = d { — d f estimated error, then, guarantee that all 

signals in the system are bounded and the tracking error £. will converge to a neighborhood 

of the origin. 

Proof. Consider the following positive define Lyapunov function candidate as 



The time derivative of the above equation is given by 

A = T t i t +tr(W?F^W i ) + tr(tfG- ] %)+*& H^ + rjM +/M (60) 

Applying(52) to (60), we have 



L =r. 



-k.T.+W 1 (S. -S' LI. -S' .&.) + W J OS" M +S' .dr.) 



(61) 



_+y/ i -8 i -v ri +g i +s i +co i 
+tr{WjF7'W^tr{tiG; x fo+tr{djH: x d^^^ 

Substituting the adaptive laws (53), (54) and (55) into (61), and( r J = — P) , yields 

L = v. [-k t T t +y/ i -5 j -v n + gi +s t +co i ]+y Wi \T\tr(Z*Z,) + r~$<k +r~Jd l d i 

< -ktf+T^ -S t )-V ri t t +T igi +\T t \(p A }i +S Ni ) 

+r m \T t \tiizJz t )+y-^ +r#d l d l (62) 

< -ktf+Tidr, -^)-v„r, +r,g ; . +|r,|$C 
+ y m IrltriZjZJ + y:^ + yjd i d i 
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Inserting (56) and (58) into the above inequality, we obtain 



rX 



L, < -krf +T i {y i -S t ) + T t g t +|r,.|^C -4iKi tanh(^) 



a, 



-<t> t 



,T,S 



-J,.r, 2 






r,,9 ffl ,.tanh(^^)-^>,.r,. 
a. 

- d, (r 2 - XJ, |r,. |) + r m \t, I tr(ZfZ i ) 

- dtf + r igi + 4 \z i | J,. J,. + ^,. \t, I ?r(Z, r Z,. ) 
Using (11), (63) becomes 

A ^ -K*) + ^ (^ - $ ) + $£«* - d rf + T >Si 

+ \h I \}M + 4X4 + rwMzfZt) 

By completing square, we have 



(63) 



+ 4.- k 



(pi l\TiTi 



(64) 



A ^ ~ k i T ? + h (&i -8,) + face, + 



St 



(65) 



+ \T, 



Wd + AM + rmHzfz,) 



With (41), (44), (53) -(58), approximation error between actual approximation inverse and 
ideal control inverse is bounded by 



\'J )r i-8 i \<c u +c 2i \T i \ + c 3 

where C u , C 2i , C 3 . are positive constants. 

A ^ - (*/ - c 2l ) r, 2 + |r, | (c„. + c 3t ||z. \ p ) + ^,a, 
4^4 + 4/4,4 + YwMZjZi) 



(66) 



+— +k 



(67) 



Ad, 



Since tr(Z;Z t )< Z, IZ, - Z, I ,^ < U. U . - U. ,<. < <<. K -< hold ' the 
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above inequality can be written as 



A * ~{K -c 2i )r* +|r,.|(c ll . +c 3l |z,y + ^«. + J 



+ r. 



/I 



tfvir/iir/i i' 



) + 4( 



j. 



k - 



j,. 



4J. 



) + Ml z 4I z 'Hl z l> 



By completing square for (68), we get 



(68) 



A < -(*, -c 2 ,.)r,. 2 +c 5i Kl + ^l^a,. + -^- 



(69) 



/ || || \2 

where C 5i = C u + C 4 . ,with c 4/ = -± |# | + -A | J, | + ^ ^ L 

For the overall system, it can be derived that the bound as 



^=ZA^Z]-(*,-c2iK+c5iN+W^,+^- 



i=i i=i 



(70) 



According to (34), | g .| < £ f .. | r .| = / T . , define % = [|r,|,|r 2 |,---|r„|f 

7=1 

r ,=[r,i'/ y ,-2,---r i „f / K = diag[k,-c 2l ,k 2 -c 22 ,---,k n -c 2n ] , C = [c 5l ,c 52 ,---,c 5n f 
, J) = y ( U. I <-./x ) / the above inequality can be rewritten as 



L<-Z 7 



k r,r/ 

Ad, 



Z + C T z + D = - Z T E Z + C T z + D 



(71) 



^-A mi AE)\\z\\ + \\C\\\\y\\ + D 

mm v //v KM 



where E = K — (4<i.) rP. , ^ min (E) the minimum singular value of E . ThenZ < , 
as long as k. > C 2i and sufficiently large d. , E would be positive definite, and 
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C +DA mia (E) | ||C|| _ x 



4A 2 min (E) 2A mm (E) 

>|^|,|J,|>|J,|,|Z,L>— (^,|Z,| f+C3 ,) 



Now, we define 



(72) 



Q ,=W WM> n * = M H-M 



a.=u 



d, 



<\d t \\,a z = \\\z\ 



'\ P *j-{rm\Zt\ P +Cit) { 

iWi \ 



(73) 



Since LZ. L>$- ->d i ,/ m ,C 3i are positive constants, we conclude that Q , Q Z| . , Q, 
and Q ^ are compact sets. Hence Z is negative outside these compacts set. According to a 

standard Lyapunov theorem, this demonstrates that Z. , ^. , d i and ^ are bounded and will 
converge to Q , Q z . , Q , and Q^ , respectively. Furthermore, this implies e. is bounded 
and will converge to a neighborhood of the origin and all signals in the system are bounded. 

4.1.3 Simulation Study 

In order to validate the effectiveness of the proposed scheme, we implement an example, 
and assume that the large-scale system is composed of the following two subsystems 
defined by 



Subsystem 1 ! 



-co 2 x n + 0.02(a) -x^)x v 



X 10 = -CO X n +K).K)Z{CQ-X n )X n +U { 

+ (xf { + xf 2 )a(u 1 ) + 0.2 + sin(0.2x 21 ) 



(74) 



Subsystem 2 '. < 



x 22 = x\ x + 0. 1(1 + x\ 2 )u 2 + tanh(0. \u 2 ) 
+ 0.15w 2 3 +tanh(0.1jc u ) 



(75) 



whereco = 0A7r /(J (u l ) = (\-e~ Ul )/(\ + e~ Ul )- The desired trajectory x dn = 0. 14sin(2f) - cos(7)] ' 



x d2l = 0.1;rcos(2£). 
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Input vectors of neural networks are z. = [x T , T- , \j/. ] T , / = 1, 2 , and number of hidden layer 
nodes both 8. The initial weight of neural network is W (0) = (0) . The center values and the 
widths of Gaussian function are initialized as zeroes, and v 5 , respectively. The initial 
condition of controlled plant is x { (0) = [0.1,0.2] r jc 2 (0) = [0,0] r . The other parameters are 
chosen as follows: 

A, = 5,*, = 5 r m .= 0.001, ^,.=1,^=1, ^,.=0.01, 4 =0.01 , a, .=10 , F,=10I m , 
G = 21 i ,, H = 21 , with / 9 I I corresponding identity matrices. 

Fig.5 shows the results of comparisons of tracking errors of two subsystems. Fig.6 gives 
control input of two subsystems, Fig. 7 and Fig. 8 the comparison of tracking of two 
subsystems, respectively. Fig.9 and Fig.10 illustrate outputs of two RBFNs and the change of 

norms of W,jU,d" , respectively. From these results, it can be seen that the effectiveness of the 

proposed scheme is validated, and tracking errors converge to a neighborhood of the zeroes 
and all signals in system are bounded. Furthermore, the learning rate of neural network 
controller is rapid, and can track the desired trajectory in about 1 second. From the results of 
control inputs, after shortly shocking, they tend to be smoother, and this is because neural 
networks are unknown for objective in initial stages. 




10 
time sec 



20 



Fig. 5. Tracking error of two subsystems: 1 (solid), 2(dot) 



I 




10 

time sec 



20 



Fig. 6. Control input of two subsystems: 1 (solid), 2(dot) 
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Fig. 7. Comparison of the tracking of subsystem 1: x n (solid) and X dn (dot) 
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8. Comparison of the tracking of subsystem 2: x 2l (solid) and X d2l (dot) 
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Fig. 9. Subsystem 1: Output of RBFN (solid), norms of JF (dash), ju (dot), <T (dash- 
dot) 
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10 
time sec 

Fig. 10. Subsystem 2: Output of RBFN (solid), norms of W (dash), ju (dot), & (dash- 
dot) 



4.2 RBFN-based decentralized adaptive control for the class of large-scale nonlinear 
systems with nonlinear function interconnections 

Assumption 10. The interconnection effect is bounded by the following function: 



kA>*2>-">*j|^Z"=i£/(l r yl)' 



(76) 



where £ .(| r . |) are unknown smooth nonlinear function, T ■ are filtered tracking errors to 

be defined shortly . 

The control objective is: determine a control law, force the output, y. , to follow a given 

desired output, X di , with an acceptable accuracy, while all signals involved must be 

bounded. 

Define the desired trajectory vector x di =[y di ,y di ,'-,y^ 1 ] T > X dl , = [y di , y di ,'",yf} T and 

tracking error e. = x — Xj. — \e., , e.~ , • • • , e., Y , thus, the filter tracking error can be written as 

° i i ai l i i " 1 1 " " iL J ° 



t, = [Af l]e, = k t .e t + k. 2 e t +■■■ + k,,_^ + c «-» , 



(77) 



where the coefficients are chosen such that the polynomial k n +k i2 s-\ f- k. / _ 1 ^ (/ '" 2) + s {lt ~ l) 

is Hurwitz. 

Assumption 11. The desired signal X di (f) is bounded, so that \\X d - < X ,. , withX^. a 

known constant. 

For an isolated subsystem, without interconnection function, by differentiating (77), the 

filtered tracking error can be rewritten as 
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i i =x ttl -yf+W ^■^ i =f i ix ii u i ) + Y di , (78) 



wfthr„=-^+[o a/>. 

Define a continuous function 



£.=£.r. + y,, (79) 



where A:, is a positive constant. With Assumption 5, we know df{x n U i )/du i ^ , 
thus, 3[/(x. 9 U f ) — S t ]/du. ^ . Considering the fact that dS t /du = , with the implicit 
function theorem, there exists a continuous ideal control input W. in a neighborhood 
of (x.J.)gQ. xi? , such that f(x i9 U*)-S i =0 , i.e. S,. = f.(x i9 U*) holds. 
Here, ^. = /J (x. , W • ) represents an ideal control inverse. Adding and subtracting <J. to the 
right-hand side of X. /t = /J (x. , W . ) + g . of (33), one obtains 

^ =y/(^«/)+ft+^-^-^// ( 8 °) 

and yields 

^=-V/ + yj(^»"*) + ft+< y /' ( 8i ) 

Similar to the above-mentioned equation (40), \f/ i = f t (x. , W. ) holds. 

Based on the above conditions, in order to control the system and make it be stable, we 

design the approximation pseudo-control input \j/ i as follows: 

¥ i = -k t r t - Y di - u ci - W T gi S gi (| r t |)r,. - v ri , (82) 

where U ci is output of a neural network controller, which adopts a RBFN, V ri is 

robustifying control term designed in stability analysis, W-S-(\ T t |) is used to 
compensate the interconnection nonlinearity (we will define later). 

Adding and subtracting If/, to the right-hand side of (81), with 8 { = k{T i + Y di - f. (x. , U* ) , 
we have 

i t = -k t r t + A . (x,. , u t , u* ) - u ci - W T .S. (I t, \)r t + S t - Wi - v ri + g, , (83) 



Adaptive Control for a Class of Non-affine Nonlinear Systems via Neural Networks 361 

where A . (x. , U . , U . ) — /J (^ -> U i)~ ft (X ? W • ) is error between the nonlinear function 
and its ideal control function, we can use the RBFN to approximate it. 

4.2.1 Neural network-based approximation 

Based on the approximation property of RBFN, A . (x. , U. , U. ) can be written as 

A . (x. , ii. , «.* ) = 0f S. (z. ) + f . (z. ) , (84) 

where W t is the weight vector, S t (z f ) is Gaussian basis function, 6*.(z.) is the 

approximation error and the input vector Z. G R q , q the number of input node. 

Assumption 12. The approximation error £.(z.) is bounded by | 6*. |< S m , with6* M > is 

an unknown constant. The input of the RBFN is chosen as z. = [x. , T i , \f/ i r ] . Moreover, 
output of the RBFN is designed as 

^=^.(z,.). (85) 

Define J^ as estimates of ideal W i , which are given by the RBFN tuning algorithms. 
Assumption 13. The ideal value of W i satisfies 

W i \\<W iM , (86) 

where W iM is positive known constant, with estimation errors as W i = W. — W i . 

4.2.2 Controller design and stability analysis 

Substituting (84) and (85) into (83), we have 

i, = -k^ + W'Si+St-Vt-v.+gt-Wrs^ t, \)T t +e t (z t ) (87) 

Theorem 3. Consider the nonlinear subsystems represented by Eq. (33) and let assumptions 
hold. If choose the pseudo-control input \j/ i as Eq.(82), and use the following adaptation 
laws and robust control law 

t=F i [S i T i - 7w ,W i \T i \}, (88) 
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»t=G / [5 g/ (|r,|)r, 2 -^|r / |], (89) 

it = \Xh(\ h I +l)tanh(r>,.)-^. | r, |], (90) 

v„.=^(|r,.|+l)tanh(r>,.), (91) 

where F. = F ( > , G t = G t > are any constant matrices, A , . , ^V- , 7 . , y \ . and OC i are 

positive design parameters, (j) i is the estimated value of the unknown approximation errors, 
which will be defined shortly, then, guarantee that all signals in the system are bounded and 
the tracking error e. will converge to a neighborhood of the origin. 
Proof. Consider the following positive define Lyapunov function candidate as 

2Z,. = x] + WjF7 x W i + W gl T G-% + Xftf (92) 

The time derivative of the above equation is given by 

A = hh + WjF7% + W/G-% + X~f4i (93) 

Applying (87) and(53) to (59) and (?) = -(") , we have 

L, = rt-kft+S, -y>, -v n+gl -W T gl S gl {\ r, IK. +*,] 

+r Wi t T t\^\+w gl T G-%+A;M 

Using (76), (94) is rewritten as 

L, < -ktf+T t {S t -W^-v^ + tXXjJAI Tj l)-^Xd *i IK] 
+ 1 r, | s m + A,M + rm W?W t k I + WjGr x W. 



(94) 



(95) 



Since £.. (•) is a smooth function, there exists a smooth function £..(| T ■ |) , (1 < /, j < ft) 
such that £„(| r . |) =| 7 . | Cij(\ T j I) hold - Thus ' we have 
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L < -k^+r.iS, -fc)- v rt r, +t?£T £ y Q r, \)-W T gi S gi (\ r, |)] 

' (96) 

+ 1 h | e m + Xf4i + YwWJW, | r, | +WjG~% 

Since the function d.(\ T f |) = / ,. ^ (| ^ |) ls smooth and T t is on a compact set, 
d f (\ r. |) can be approximated via a RBFN, i.e., d t (\ T i |) = W gi S gi (\ T t |) + £ gi , with 
bounded approximation error £. , | £. |< 6* M . fF . is estimate of ideal fF. , with 
boundedness 1 1 W • 1 1< ^^- a ^1m > a known constant, and the estimation errors as 
W gi = W gi - W gi . Then, (96) becomes 

+ V, I %• + $M, + YwWJW, \t, | + fT/G,- 1 ^ (97) 

< -ktf+t t (S t -V>,)-v n T i+ TfW^S gi (\ r, |) + ^.r, 2 + 1 r,. | s Ni 

+KM + rw l w*w i V\+wjG-'w gi 

Substituting the adaptive law (89), we obtain 



L <-kr 2 +t(S -y/)-v x. +s Ar r 2 +| t. I £ AT . +Ajd>A 

i ii i V i r i / n i gNi i \ i \ Ni (pi tit i 

+r m ^w i \T i \+ r wJw.\T i \ 



(98) 



Define^. = max(f M ,£ m ), with ^. is its estimate, and $ = $ —(f) i with ^. error. (98) can 
be rewritten as 

Z. <-^.r, 2 +r.(^. -yf t )-v ri T t +</> t (T- + \r,\) 
+ I'M + YwW-W, |r, | + Y p WjW. |r, | 

Applying the adaptive law (56) and robust control term (58), we have 
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L, < -k t T? + t, (8 t -y/,)- fa, (|r, | + 1) tanh(r, /a, ) + fa \r, | (\r, | + 1) 

- fa, (\t, I + 1) tanh(r ; /a, ) + yJr?W t \t, \ + y g WjW gi |r, | + lj>,fa \t, \ 

= -k,rf + t, (S i -y>,) + fa \r t | (|r. | + 1) - far, (\t, \ + 1) tanh(r, /a, ) 
+ YwWJW, |r, | + y gl WjW gi \t\ + A J, fa V, | 

= -k,T? + r. (8 t -w,) + fa (|r. | + 1) [|r, | - r, tanh(Y, /«,.)] 
+ y^ r ^ K I + r gi w/w gi \ Tl I + Aj,fa |r, I 

Using (11), we get 



+ r W T W \t \+y W T W \t 



(100) 



(101) 



m i *i 



With (82), (85), and (88)-(91), the approximation error between the ideal control inverse and 
the actual approximation inverse is bounded by | 8 i —y/ t |^ C u + C 2i \ T t \ 

+C 3 . 1 1 W i 1 1 +C 4i 1 1 W ■ 1 1 , with C u , C 2i , C 3| . , C 4 . positive constants. Moreover, we utility the 

facts, CI tf <| | tf 1 1 1 1 tf 1 1 — 1 1 5 1 1 , (101) can be rewritten as 



A^-(**-^«)^ 2 +kJ(^+^||^||+^||^||)+k*l 



^ f Wi -n 



+y . pr. \\(\\w. \\-\\w . \\) 



< - ( &. - c 2 . ) r. 2 + q . |r. | + (j) i g i a i + |r. [ 



H-^|(ki-|^|) 
+ [-^IKII 2+ (|KI + ^)|^ 



+ </>&<*, 



Completing square for (102), we have 

A ^-( k i~ c n ) ^ + % | r, | +^a, 



(102) 



(103) 
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with c 5i = Y m II W t || +c 3i ,c 6i = r gl || W gi || +c 4i , c 7 , = tf/Uj. +4/4^ +4/4^ , 



For the overall system, we have 

^ = Z"=i A ^ E"=iH*< -c 2/K + c H 1 2-,. 1 +^,.« ; ] 



(104) 



Now, define j = [|r, |,— | r„ |f , JT = diag[k l -c 2l , — ,k H -c 2n ] , C = [c sl ,c ii2 ,---,c li J T , 
D = ^" {<f>.q.a ) ■ (104) can be rewritten as 



L<- Z T K % + C T Z + D<-A mm (K)\\ Z \\ 2 +\\C\\\\ Z \\+D 

By completing square, yields 



L<-^(K) 



L/C 



c 






^ffl 



- + D 



2-UWy 
Clearly, L < , as long as &. > C 2i , and 

1*1*4 U|>vi4 IKN^IKI, Wl *c 6ir -J Ik,. 



(105) 



(106) 



(107) 



where A = > /[jjcjf + Z>>^ (tf)]/^ (*)] +|C|/[2A min (^)] with ^ (^) the minimum 



singular value of AT . 
Now, we define 



n*=M IHN4^=M| I^N'W}' 



Q,,=M 



-JF&- 



ftII<c r _1 IIjtIII q = IIIft II IIjt ll<c r _1 ll^ III 



(108) 



Since |^|,|^|,^,^,^,^,C 5 pC w are positive constants, we conclude 

that Q , Q / , Oj^. and Q Ws . are compact sets. Hence L is negative outside these 
compacts set. According to a standard Lyapunov theorem, this demonstrates that 
W.,W -,^. and jjf are bounded and will converge to Q , Q, , Q^. and £2^. , 
respectively. 
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Furthermore, this implies £. is bounded and will converge to a neighborhood of the origin 
and all signals in the system are bounded. 

4.2.3 Simulation Study 

In order to validate the effectiveness of the proposed scheme, we implement an example, 
and assume that the large-scale system is composed of the following two subsystems 
defined by 



Subsystem 1 ! < 



X ll — X 12 



x 12 = -co 2 x n + 0.02(a) -x l \)x l2 +u x + (x { \ + xl 2 )<j(u x ) 

+ 0.1||x 2 ||exp(0.5||x 2 1|) 



(109) 



Subsystem 2 ! 



x 22 = x 21 + 0.1(l + x 22 )i/ 2 + tanh(0.k/ 2 ) + 0.15t/ 2 

+ 0.2 || x 2 || exp(0.1|| x 2 ||) 



(110) 



where co = 0A7r , a(u Y ) = (l-e~ Ul )/(l-\-e~ U{ ) • The desired trajectory 
x dn = 0.1;r[sin(20-cos(7)]' x d2l = 0.1;rsin(20 . For the RBFNs as (84), input vectors are 
chosen asz. = [x. ,r.,^.] , i = 1,2 and number of hidden layer nodes both 8, the initial 

weights W i (0) = (0) and the center values and the widths of Gaussian function zero, and 2, 
respectively. For the RBFNs, which used to compensate the interconnection nonlinearities, 
both input vectors are [r p r 2 ] , number of hidden layer nodes is 8, the initial 

weights W ■ (0) = (0) , and the center values and the widths of Gaussian function zero, 

and V5 / respectively. The initial condition of controlled plant is x { (0) = [0.2,0.2] r , 

x 2 (0) = [0.3,0.2] r • The other parameters are chosen as follows: A. =1,A:. =2 , 

^=0.001,^. =0.1,^=0.01, a, =10 , F t =l0I m ,G = 2I gi , with I m ,I gi 

corresponding identity matrices. Fig.ll and 12 show the results of comparisons of tracking 
errors and control input of two subsystems, Fig.13 and 14 the comparison of tracking of two 
subsystems, respectively. Fig.15 and Fig.16 illustrate the norm of the four weights in two 
subsystems, respectively. From these results, it can be seen that the effectiveness of the 
proposed scheme is validated, and tracking errors converge to a neighborhood of the zeroes 
and all signals in system are bounded. Furthermore, the learning rate of neural network 
controller is rapid, and can track the desired trajectory in less than 3 seconds. From the 
results of control inputs, after shortly shocking, they tend to be smoother, and this is 
because neural networks are unknown for objective in initial stages. As desired, though the 
system is complex, the whole running process is well. 
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Fig. 12. Control input of subsystem!: U x , and subsystem 2: W 2 
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Fig. 14. Comparion of tracking of subsystem 2 
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Fig. 15. The norms of weights and output of RBFNof subsystem 1 
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Fig. 16. The norms of weights and output of RBFNof subsystem 2 



5. Conclusion 

In this chapter, first, a novel design ideal has been developed for a general class of nonlinear 
systems, which the controlled plants are a class of non-affine nonlinear implicit function and 
smooth with respect to control input. The control algorithm bases on some mathematical 
theories and Lyapunov stability theory. In order to satisfy the smooth condition of these 
theorems, hyperbolic tangent function is adopted, instead of sign function. This makes 
control signal tend smoother and system running easier. Then, the proposed scheme is 
extended to a class of large-scale interconnected nonlinear systems, which the subsystems 
are composed of the above-mentioned class of non-affine nonlinear functions. For two 
classes of interconnection function, two RBFN-based decentralized adaptive control schemes 
are proposed, respectively. Using an on-line approximation approach, we have been able to 
relax the linear in the parameter requirements of traditional nonlinear decentralized 
adaptive control without considering the dynamic uncertainty as part of the 
interconnections and disturbances. The theory and simulation results show that the neural 
network plays an important role in systems. The overall adaptive schemes are proven to 



Adaptive Control for a Class of Non-affine Nonlinear Systems via Neural Networks 369 



guarantee uniform boundedness in the Lyapunov sense. The effectiveness of the proposed 
control schemes are illustrated through simulations. As desired, all signals in systems, 
including control signals, are tend to smooth. 
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Appendix A 

As Eq.(19), the approximation error of function can be written as 

M T cr-M T a = M T cr-M T (i+M T cr-M T a = M T (cr-&)+M T a 
Substituting (18) into the above equation, we have 

M t (<j-<7) + M t <j 

= M T [a + a'N T x m + 0(N T xJ 2 ]+M T [a'N T x im +0(N T x m f] 

= M T <7+M T <7'N T x +M T a'N T x +M t O(N t x f 

nn nn v nn ' 

= M T a+M T a'N T x m -M T a'N T x m +M T &'N T x nn +M t O(N t xJ 2 
= M T (a- S-'N T x m ) + M T a'N T x nn + M T a'N T x m + M T 0(N T x nn f 

Define that 



a = M 1 a'N' x m +M' 0(N' xY 

nn v nn' 
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so that 



M T a-M T a = M T (a-a'N T x )+M T a'N T x +cd 

\ nn-' nn 



Thus, 



co = M T a-M T a-M T (a-aN T x)-M T aN T x„„ 

v nn / nn 

= M r a-M r 6-+M T <j'N T x im -M T a'N T x im 

nn nn 

= M T (a-a)+M T a'N T x m -M T &N T x m -M T a'N T x m 
= M T (cr-a)+M T ( 7'N T x m -M T a'N T x m 



Appendix B 

Using (46) and (47), the function approximation error can be written as 

n?s t - w[s t = wi r s i - w*s t + wfs, - w*s t = wfs, + w*s t 
= w[[s t +5;a +s' a A +o(ju i ,a i f]+w^s i +s'^ i +&, +oiAAf -s t ] 

= W^S t +W'[S' Ml (U -A)+4(oi -&,)]+#?&% +S' <7i a i )+W^O( J u i ,a i ) 2 

= w?(§, -i;A -s'^+w^s'^ +s' m .c7 i ) + w^s' Mi fi l +s' ai a i ) + w^o(M i A) 2 



define as 



^^(S'u+S'^+WjOifi^) 2 



Thus, 
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co t (t) = w?s, +w?s t -w?(s, -£;,a -s'^-w^s'^+KA,) 
=w?s i +^(^,A +KA,)-w?%ju, +KA) 



